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DECLARATION OF C. NEAL STEWART UNDER 37 CFR § 1.132 

Dear Sir: 

1, C. NEAL STEWART, Jr., hereby declare that 

I L 1 reCdved a Ph " D - ***** fr ™ Virginia Polytechnic Institute and State 
University in December, 1993. 

2. I was an Associate pr0fess0r of BiQlogyj Uniyersily of North CarQlina 
Greensboro, 1995-2002. 

"3. I am currently a Professor, Department of Plant Sciences, University of 
Tennessee and recipient of the Racheff Chair of Excellence in Plant Molecular Genetics. 

.4. I am a co-inventor of the above-identified patent application. 
5. This declaration is submitted to demonstrate that the disclosure of the 
present application would have enabled a skilled scientist to prepare additional nucleic acid 
constats hav^^^ 

inhibitor and to use such constructs to confer insect resistance to plants by transforming 
plants w,th such construe*. Furthermore, this declaration is submitted to illustrate that the 
chemical and structural profiles of the Kunitz-type serine proteinase inhibitors are so well- 
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characterized in the literature that a skilled scientist knows what essential structural and 
chemical traits adhere to a protein identified as a member of the Kunitz-type serine proteinase 
inhibitor family. 

! 6. Proteins that function as proteinase inhibitors are exceptional among 
proteins, because they tend to retain their inhibitory activity even when the active site residue 
("Pi") is replaced by another residue (Laskowski et al., "Protein Inhibitors of Proteinases," 
Ann. Rev. Biochem 49:593-626 (1980), at 594, last para.) ("Laskowski") (attached hereto as 
Exhibit 1). Unlike other proteins, in which the replacement of active site residues leads to a 
complete loss or dramatic decrease in activity, proteinase inhibitors have the ability to 
tolerate a synthetic or mutational replacement of the Pi residue (Id. at 607, 1st full para.). In 
some cases, such a substitution leads to a predictable change in inhibitory activity, e.g., an 
Arg63 to Trp63 substitution at the active site in the archetypal Kunitz soybean trypsin 
inhibitor ("STI") leads to the conversion of a trypsin inhibitor to a chymotrypsin inhibitor (Id. 
at 594, last para.) Thus, within each inhibitor family, the Pi residue is not conserved, but 
changes frequently, resulting in inhibitory specificity, but not in a loss of biological activity 
(Id.). For example, it has been shown that generally, inhibitors with Pi Lys and Arg tend to 
inhibit trypsin and trypsin-like enzymes, those with Pi Tyr, Phe, Trp, Leu, and Met inhibit 
chymotrypsin and chymotrypsin-like enzymes, and those with Pi Ala and Ser inhibit-elastase- 
like enzymes (Id. at 606, 1st full para.). Because "serine proteinase inhibitor" encompasses 
inhibitors of trypsin, chymotrypsin, and elastase (see Example 9), a mutation or substitution 
in even the active site residue of a serine proteinase inhibitor can still result in a protein with 
serine proteinase inhibitory activity (Laskowski at 606, 1st full para.). 

7. Therefore, a nucleic acid molecule that hybridizes to a nucleic acid 
molecule having the nucleotide sequence of SEQ ID NO: 1 of my present application under 
stringent conditions may encode a protein with some variation in amino acid sequence from 
the protein encoded by SEQ ID NO: 1 while still retaining functionality as a serine proteinase 
inhibitor. A skilled scientist would not expect substantial variation among species 
encompassed by the claimed invention because of the highly stringent conditions set forth in 
the claims, some, but not much, variation would be expected in either the nucleotide 
sequence or the protein it encodes. 

8. Furthermore, my patent application teaches with particularity how to 
make and use a nucleic acid molecule that hybridizes to the nucleic acid molecule having the 
nucleotide sequence of SEQ ID NO: 1 under stringent conditions. 
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; 9. Specifically, my patent application teaches the isolation of a PI gene 
from plant material and the preparation of a cDNA library having a PI clone (Example 1), the 
preparation of a shuttle vector containing the PI gene (Example 2), the transfer of the gene 
into a plant expression vector construct and subsequent transformation into Agrobacterium 
tumefaciens for use in transforming plant cultures (Example 3), and the characterization of 
the bopi gene, including the open reading frame of 642 bp, which encodes the mature BOPI 
protein (Example 4). The BoPI protein is highly characterized in the present application at 
Example 5, The amino acid sequence of the protein is disclosed, a putative Kunitz inhibitor 
family amino acid signature is identified, the Arginine active site residue at position 63 is 
disclosed, and the four cysteines of the expected disulfide bonds are identified (pg. 23, lines 
5-28). Furthermore, the BoPI protein is characterized as having a molecular weight of 21 
kDa and an isoelectric point of 4.94 (pg. 8, lines 25-29). 

10. In addition, Example 6 teaches that the PI gene isolated from Brassica 
oleracea is one of a family of PI genes, thereby teaching that additional PI genes may be 
isolated from Brassica oleracea. Examples 7-10 teach how to make heterologous transgenic 
plants having a PI gene of the present invention, and provide methods for determining the 
specificity and efficacy of the proteinase inhibitor's insect antibiosis activity when a serine 
proteinase inhibitor is expressed in a transgenic plant. 

11. Therefore, it is clear that a skilled scientist having read the present 
application would know: (1) how to make additional nucleic acid constructs having one or 
more operatively linked nucleic acid molecules which encode a Kunitz-type serine proteinase 
inhibitor isolated from Brassica oleracea having insect antibiosis activity, an operably linked 
heterologous DNA promoter, and an operably linked 3' regulatory region; (2) how to use 
such constructs to prepare expression vectors and host cells, including plant cells; and (3) 
how to prepare transgenic plants transformed with the construct that are resistant to insects. 

1 2. Protease inhibitors ("Pis") from plants have been studied for over 50 
years (Song et al., "Kunitz-Type Soybean Trypsin Inhibitor Revisited: Refined Structure of 
its Complex with Porcine Trypsin Reveals an Insight into the Interaction Between a 
Homologous Inhibitor From Erythrina caffra and a Tissue-Type Plasminogen Activator," J. 
Mol Biol 275:347:363 (1998) ("Song") (pg. 347 1st para, and pg. 348, 2nd full para.) 
(attached hereto as Exhibit 2). Pis are categorized into families, based on analogy (similar 
function), homology (similar amino acid structure), and mechanism of inhibiting proteinase 
activity (see generally , Laskowski). It is known that most individual protein inhibitors inhibit 
proteinases.belonging to a single mechanistic class (Laskowski at 593, 1st para.). Of these, 



Sefi 26 04 08: 13a PLANT SCIENCES 



8659741347 



-4- 

the inhibitors of serine proteinases are the most studied (Id.) and, thus, are the most highly 
characterized Qd. at 601, 1st full para.). Sequencing and X-ray crystallography have shown 
that these inhibitors are not all homologous; rather they belong to about 10 homologous 
families (Id. at 601, 2nd full para.). As disclosed in my patent application, the proteinase 
inhibitors of the present invention belong to the Kunitz family of proteinase inhibitors (see 
Example 5). Kunitz-type proteins are characterized as having a molecular weight (M t ) of 
about 21,000-22,000 Daltons, two-disulfide bonds (4-half-cystine residues) and a single 
reactive site for serine proteases (Terada et al,, "Amino Acid Sequences of Kunitz Family 
Subtilisin Inhibitors from Seeds of Canavalia lineata? J. Biochern. 115:397-404 (1994), first 
full para.) (attached hereto as Exhibit 3). 

| 13. Another characteristic which is used to establish PI families are the 
topological relationships between the disulfide bonds (also known as disulfide "bridges") and 
the location of the PFs reactive site (Laskowski at 610, 2nd full para.). In each family, the 
positions of all intrachain disulfide bridges are completely conserved (Id-)- Furthermore, 
recent X-ray crystallographic studies of the archetypal member of the Kunitz family, soybean 
trypsin inhibitor ("STT), show that Kunitz-type inhibitors from a variety of plant sources 
share a high degree of homology to the three dimensional structure of SH (Song at pg. 347, 
1st para, and pg. 348, 2nd full para.). Using the studies based on the structure of SIT, it can 
be predicted that the reactive site of the serine proteinase inhibitors of the present invention is 
positioned on an exposed loop of a characteristic canonical confirmation, unconstrained by 
secondary structural elements or disulfide bridges that could limit its conformational freedom 
Qd. at 354, 1st full para.). It has been reported that in Kunitz-type mono- and dicotyledons 
there is clear alignment of the Pi sites on the loop whether for trypsin, chymotrypsin or 
subtilisin (Peng et al., "Comparison of Amino Acid Sequences of the Trypsin Inhibitor From 
Taro (Colocasia esculenta), Giant Taro (Alocasia macrorrhiza) and Giant Swamp Taro 
(Cyrtospenna chamissonis" Biochern, Mol Biol Infl 31:73-81 (1993) at 80, 1st full para.) 
(attached hereto as Exhibit 4). In addition, proteinase inhibitors from Brassica oleracea have 
been characterized as having molecular weights ranging from 9-25 kDa, isoelectric points 
from 4.5-5.0, and trypsin and chymotiypsin activity that is relatively stable over a range of 
temperature from 0-100°C and at pH values of 4.5-7.5 (Broadway, KM., "Purification and 
Partial Characterization of Trypsin/Chymotrypsin Inhibitors from Cabbage Foliage," 
Phytochemistry 33:21-27 (1993), at the Abstract ("Broadway I") (attached hereto as Exhibit 
5). 
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i 14. I am familiar with WO 91/09060 to Broadway ("Broadway IF) cited 
by the PTO. Broadway II suggests that the efficacy of a specific inhibitor from an individual 
plant is dependent upon 1) the unique structure of the plant proteinase inhibitor, and 2) the 
susceptibility of the proteinase in the target organism (see page 2, lines 19-24). While the 
efficacy of an inhibitor may be dependent, in part, on the inhibitor's unique structure, one can 
utilize an inhibitor against a susceptible target organism (i.e., a herbivorous insect) without 
knowledge of the inhibitor's structure because the inhibitor exhibits its functional activity 
against a susceptible target whether the user is cognizant of the structure of the inhibitor or 
not. For example, Broadway II used three Pis isolated from cabbage to successfully inhibit 
the growth of herbivorous insects, knowing nothing more about the proteins than their 
molecular weight, isoelectric point, and enzymatic inhibitory activity (see Examples Mil). It 
was the knowledge of the specific activity of a proteinase inhibitor that drove Broadway's 
choice to use a particular PI against a given insect A skilled scientist would not consider it 
undue experimentation to test the specific activity of a serine PI. This can be done by 
carrying out an enzyme inhibition assay against trypsin, chymotrypsin, or elastase, such as 
described in the present application (see Example 9) or as disclosed in art (e.g., see Broadway 
I, starting at pg. 24, right col., second full para., to pg. 26 end of 1 st partial para.), or by using 
a bioassay (see Example 10). Once the specificity of an isolated molecule is identified, the 
molecule can be used as taught in my patent application to confer insect resistance to plants. 
Thus, a gene, including one encoding a serine proteinase inhibitor, and the protein it encodes, 
can be used successfully for its intended purpose without a complete understanding of the 
structural character of either the nucleic acid molecule or the protein. 

1 5. I hereby declare that all statements made herein of my own knowledge 
are true; and further that these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or imprisonment, or both, under 
Section 1001 of Title 18 of the United States Code and that any such willful false statements 
may jeopardize the validity of the application or any patent issued thereon. 
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PERSPECTIVES AND SUMMARY 




Protein inhibitors of proteinases are ubiquitous. They are present in multi- 
ple forms in numerous tissues of animals and plants as well as in microor- 
ganisms. Their gross physiological function is the prevention of unwanted 
proteolysis, but detailed physiological function has been only rarely eluci- 
dated. Withrthe exception of m^oglobulim, which 
ctfaU classes* ii^ 




to a single mecha^ 
^serine proteinases.^ 
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teasubstratei^ How- 
ever, while kat/Kn for this interaction is large, both k^ and K m are, at 
neutral pH, very small 

In the stable enzyme-inhibitor complex the reactive site peptide bond of 
the inhibitor is still intact, but the carbonyl carbon is no longer trigonal. It 
is appreciably pyramidalized by the interaction of its oxygen with the 
"oxyanion hole" of the enzyme (not, as believed earlier, by the formation 
of a fourth bond with 0? oxygen of the catalytic seryl residue of the 
enzyme). The conformation of residues surrounding the reactive site pep- 
tide bond is that of an optimal substrate. Reactive sites are very rigid. Upon 
enzyme-inhibitor interaction conformational changes in either partner are 
minimal; it is almost a classical lock and key interaction. 

The known inhibitors are not all homologous but consist of several 
families, which must have arisen by convergent rather than divergent evolu- 
tion. 

A striking characteristic of inhibitors is the presence of several homolo- 
gous reactive sites on the same polypeptide chain, which arise from gene 
elongation by repeated duplication. In the case of the Bowman-Birk inhibi- 
tor family this gives rise to two homology regions, which are interconnected 
by interhomology region disulfide bridges. In many other cases (ovomu- 
coids, ovoinhibitors-protemase inhibitors from avian serum, dog subman- 
dibular inhibitor, inter-a-trypsin inhibitors from mammalian serum) the 
gene doubling gives rise to several (two, three, or six) tandem domains with 
all disulfide bridges intradomain and only short connecting peptides con- 
necting the domains. Chelonianin from turtle egg white consists of two 
tandem domains on the same polypeptide chain, but the two domains | 
belong to two different inhibitor families. i 

TKe'mliMtorSWex^ 

i(?i)ty ( anote 
«abk«change m^ 

*ttyjJsitt}inMn^ j 
*also:o«TOduring^ ! 

«i®R*idujys^not^^^ j 
^changtt^infcb^ J 
<*ov«nucwds)this,ra^ 

^-nyperyari^ 
^contraststostheieygl^^^ 
^iiser^rHowcvej^ 

^eW^to^Wospofcactivity. ' i!fii 
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PROTEIN INHIBITORS OF PROTEINASES 595 
INTRODUCTION 

Protein inhibitors of proteinases have not been reviewed here before. How- 
ever, a book (1) and numerous reviews on this topic (2-13) have appeared 
elsewhere. Two international conferences on inhibitors were held (14, 15), 
and inhibitors were the main topic of the 23rd meeting on Protides of 
Biological Fluids (16). Extensive discussions of inhibitors are included in 
many books and symposia on the biological role of proteinases (17-22) The 
literature of this field is huge, with about 500 titles appearing per year. 
Limitations of our competence and of space led us to deemphasize in this 
review descriptions of partially characterized inhibitors, and of extensive 
studies on the physiological role and possible medical applications of pro- 
lan inhibitors. We are also largely omitting the extensive and highly contro- 
versial literature on the mechanism of action of large inhibitors in 
mammalian blood. Because the mechanism of action of small inhibitors of 
serine proteinases is by now rather well known, we focus instead on the 
newer, extensive information on amino acid sequences of a large number of 
inhibitors. 

While protein inhibitors of enzymes other than proteinases are well 
known [e.g. inhibitors of a-amylases (23), deoxyribonuclease I (24), phos- 
pnohpase A (25), and protein kinases (26)], such inhibitors seem relatively 
rare compared to protein proteinase inhibitors. The reason for this seems 
easy to guess at proteins are substrates for proteinases. All of the presently 
accepted and proposed mechanisms of action of protein proteinase inhibi- 
tors involve either a requirement for proteolysis or for action as a substrate 
analogue. 

Limited Knowledge of Physiological Function 
The physiological function of inhibitors in the broad sense is clear-elimi- 
nation of unwanted proteolysis. However, detailed functions are not clear 
It is generally agreed that the presence of secretory trypsin inhibitors (Ka- 
zal) in the pancreas of vertebrates prevents premature activation of tryp- 
sinogen and, in turn, of other pancreatic zymogens. The predisposition to 
emphysema found in individuals with genetically determined low levels of 
arprotemase inhibitor strongly suggests protection against excessive pro- 
tajjw ' of lung tissues, but, while leukocytic elastase is the frequency 
Proposed target enzyme for a r proteinase inhibitor, that assignment is al- 
ready somewhat controversial (27). It is generally agreed that fSe presence 
of large numbers of proteinase inhibitors in massive amounts in mammalian 
plasma serves to delimit blood clotting (28) and other proteolytic cascade 
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596 LASKOWSKI & KATO 

processes, such as hormone production and elimination and complement 
fixation (29). However, the precise physiological involvement of inhibitors 
in these processes is not known. 

A good case has been made that many proteinase inhibitors in plants 
serve a defensive function against insect infestation by inhibition of insect 
proteinases. Ryan demonstrated that wounding of potato leaves leads to 
great mcreases in the level of a polysaccharide hormone, proteinase inhibi- 

| < f ?T which m ^ to hu «e tocreasestoim^itor 
levels (30 31). Many other exciting studies on the role of inhibitors are 
reported but m most cases the results, while suggestive, are far fiom un- 
equivocal In some cases, added research complicates rather than supports 
the naive original assignments. Thus an important role was at first envisaged 
for acroan inhibitors in the process of sperm capacitation, but more recent 
work casts doubt on this conclusion (32). 1 

iJ?. 0 *" ******* * itUa * on h ** worse » mainly because proteinase 
inhibitors are most frequently discovered and isolated by noting their ability 
to inhibit some readily available proteinase, e.g. bovine trypsin, rather than 
physiological target enzyme. They thus become proteins in search of a 
funcbon, rather than proteins isolated to account for a previously discov- 
ered biolopcai function. Because most laboratories used only bovine trypsin 
as the iwtoal test enzyme, an opinion formed that most proteinase inhibit 
are inhibitors of trypsin. As chymotrypdn, elastase, and subtilisin weS 

Nc^ZT^' °T y bUh,ton ° fth ™ ™ discovert 

kSS^L *^?i k ^ tob ? ^^^vmes for most of the isolated 

SSSS?" ^ ° f k ?7 M,e * **** en ^ es for the majority 
of inhibitors is now one of the major stumbling blocks in the understanding 

22^ ^°^ Pr ° Wem ******t solvit intracellular 
proteinases and for their cognate inhibitors m 

Nomenclature Problems 

In a review published nine years ago we deplored the confusion in inhibitor 
^clature(2) Since then a great deal aLt mhibitors hasZiT^ 

JT ZTtV ° eW di$COvered » the confusion^ws 

worse Newly discovered inhibitors are usually named accord^ totTe 

pSfl-Both procedures cause confusion: tte bfolojical sourcebecau* 3b! 

^«^« to more than Mefaauly ^ta^JEESS££ 

& such cases we i. ve susgeMed rae of ^ ^ 
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discoverer as a distinguishing method (e,g- soybean trypsin inhibitor (Ku- 
nitz) and soybean trypsin inhibitor (Bowman-Birk); pancreatic trypsin in- 
hibitor (Kunitz) and pancreatic secretory inhibitor (Kazal)]. However, this 
is not sufficient In soybeans there exist many inhibitors of the Bowman- 
Birk family and these are now simply designated by letters and numbers 
.(37). The Kunitz inhibitor from pancreas is not organ-specific but occurs 
in many bovine organs and most recently has been assigned to mast cells 
(38). The recent finding that the inhibitor in chicken plasma and chicken 
ovoinhibitor probably have the same amino acid sequence poses a serious 
problem (39, 40), The two inhibitors should probably be assigned the same 
name, but the well-established name, ovoinhibitor (41), seems grossly inap- 
propriate for a plasma inhibitor. Naming after biological source clearly 
produces confusion and occasionally leads to the wasted effort of character- 
izing the same inhibitor several times. Even greater problems stem from the 
use of the inhibited enzyme as a part of the name. First, the discoverer 
usually does not know the true target enzyme or the enzyme that is most 
strongly inhibited- Thus a r proteinase inhibitor was first named ^-anti- 
trypsin, when in feet many workers suspect that trypsin is not an important 
target enzyme for this inhibitor nor, among the many enzymes it inhibits, 
is trypsin most rapidly or strongly inhibited (42). Yet, because of the name, 
most studies of enzyme interactions with this inhibitor focus on trypsin* The 
suggestion to change the name to arproteinase inhibitor seems to have only 
limited acceptance. Another objection to specifying the inhibited enzyme as 
part of the name of the inhibitor is the recently discovered phenomenon of 
reactive site variability (37, 43-45). £fcequep%^ 

seas^#§©^^ 
f^tiypa^ 

inhibitor, (s.g* chicken and bobwhite quid! ovomucoids)^ 

Another strategy, which seemed superior, was to assign to inhibitors 
totally trivial names such as Trasylol®, Bayer's trademark for pancreatic 
trypsin inhibitor (Kunitz), or ones that indicate only the source and in- 
hibition but do not specify the enzyme, such as ovomucoid and ovoin- 
hibitor. 

We suggest that a rational system of nomenclature should involve first 
assigning the inhibitor to a family, then specifying the number of domains 
(if more than one) and the species from which it is isolated. The additional 
designators, such as its tissue source and the enzyme inhibited should be 
added last, since they are most likely to be confusing. However, as the list 
of families is still quite preliminary and since assignment to a family requires 
sequencing, introduction of the new nomenclature is still premature. 
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598 LASKOWSKI & KATO 
a 2 MACROGLOBULINS 

The term protein inhibitor of proteinases could in principle denote any 
protein that diminishes the enzymatic activity of a proteinase. The most 
bizarre examples, such as proteinases that Inhibit" other proteinases by 
digesting them, fortunately are not called proteinase inhibitors. However 
two entirely Dissimilar groups of proteins are called proteinase inhibitors' 
One of them consists of the macroglobulins, high-molecular-weight proteins 
apparently present in the plasma of all mammals, often in multiple forms. 
They are exemplified by a 2 human macroglobulin, often abbreviated as 
a 2 M, which has a molecular weight of 720,000 and consists of four appar- 
ently identical polypeptide chains. It dissociates into pairs of chains upon 
denaturation and into single chains upon reduction of disulfide bridges (46, 

When ajM combines with proteinases, only the proteolytic activity to- 
ward large protein substrates is decreased or eliminated. The activity to- 
ZS? 3S?° burSt substrates snail synthetic substrates is unimpeded 
(48) Strikingly, the oaM-proteinase complexes can still be inhibited by 
small protein proteinase inhibitors, such as pancreatic trypsin inhibitor 
(Kunitz) or pancreatic secretory trypsin inhibitor (Kazal), but not by larger 
ones such as soybean trypsin inhibitor (Kunitz) (49). Clearly, in the a 2 M 
complexes, the active site of the proteinase is quite open; it is the access of 
substrates and inhibitors to this site that is stoically hindered. The other 
remarkable feature of macroglobulins is their ability to bind a wide variety 
of proteinases belonging to all four mechanistic classes (50). Indeed, it is 
relativdy rare to find endoproteinases that do not form complexes with 
macrogtobuhns. The above facts gave rise to the "trap" mechanism of 
macroglobuliBaction (51). The proteinase hydrolyzes one or more particu- 
f"* y SG3c *v»«e peptide bonds in a 2 M, and triggers a conformational 
cbange m ojM which traps the enzyme molecule. Only molecules of very 
mgh molecular weight can act by this mechanism. To account for the broad 
jecificity the critical peptide segment must be rather long since it must 
contain peptide bonds matching the specificities of tibe various proteinases, 
formal 5pedfic P* 0 * 0 ^ cleavages upon complex 

rW .fif^S *° ™ d ™ MtT ° °° mpaTes *" importance of a 2 M with 
that of other inhibitors present in blood. Upon addition of a small quantity 
of enzyme to a mixture of OjM and another inhibitor, complexes with both 
nhibitors are formed in a ratio that depends upon the relative values of the 

S ^S/*^*^* f ° r association - However, the complex with a 
typical inhibitor dissociates, albeit slowly, while the a 2 M complex is "irre- 
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PROTEIN INHIBITORS OF PROTEINASES 599 

versible" and does not The result in an in vitro experiment is that all of 
the proteinase is ultimately transferred to an a 2 M complex, unless there is 
so much enzyme present that c^M is saturated. In vivo this effect is greatly 
exaggerated, since o^M proteinase complexes are cleared very rapidly; the 
half-life is about 10 min in man (52), Thus, essentially all injected protei- 
nases are cleared by the ct^M pathway. Other serine proteinase inhibitors 
are only transiently involved in transferring the proteinases to be cleared. 
There is considerable interest in understanding the signals leading to the 
more rapid clearance of complexed as opposed to free c^M lie view 
presented above is an idealization and probably does not apply to all serine 
proteinases* 

The research on macroglobulins clearly shows that they are of central 
importance in processing of proteinases in blood However, their interaction 
with proteinases is so different from that of other protein proteinase inhibi- 
tors that at least to chemists their designation as inhibitors seems undesir- 
able. 

INHIBITORS WITH CLASS-SPECIFIC REACTIVE 
SITES " 

The remaining protein proteinase inhibitors differ strikingly from the mac- 
roglobulins. For an overwhelming majority of them the following three 
statements are justified: 

^In^thef^^ 

substrates areitotaUy ab^ 

2. The inhibition is strictly competitive. 

particular inhibitory reactive site^<^ 
ing^^singte one ofette four 

AH of these statements axe frequently contradicted in the literature, mainly 
because of experimental errors, in our opinion (see 2). We are not aware of 
any well-characterized system that contradicts any of these three state- 
ments. If point 3 is granted then it is most convenient to divide the inhibitors 
into four major classes* 

Inhibitors of Carboxyl Metatto and Sulfhydryl Proteinases 
Many inhibitors of these proteinases have been reported, some of which are 
listed in Table 1. However, they remain an understudied group. Complete 
amino acid sequences are known for only two such inhibitors; three-dimen- 
sional structures for none. In spite of a few elegant studies there is no 
definitive proposal for the mechanism of action of any of the inhibitors listed 
in Table L 



.JUL 22 2003 n:01 flH FR CISTI ICIST 



P. 10/36 



600 LASKOWSKI & KATO 

Table! Some inhibitors of thiol, carboxyl, and metallo-proteinaies 



Enzyme inhibited 



Thiol proteinase 

Ficin, papain 

CathepsinBl.C 

Bromelain 

Papain, ficin, bromelain 
Papain 

Papain, ficin, bromelain 
Papain, catheprin Bl 

Papain, cathepxfo Bl 
Ficin, papain 

Human liver 
cathepsinBl 
Carboxyl proteina se 
Pepsin 

Cathepsin D, E 
Cathepsin D 
Pepsin 
Pepsin 

Metallo proteinas e 
CoUagenase (human) 



Collagenase (human gastric 
mucosal) 

Ca 2+ -dependent neutral 
proteinase 

Neutral metallo proteinase 
Carboxypeptidase A, B 
Carboxypeptidase A 
Aminopeptidase 



Source of inhibitor 



Avian egg white 
Avian egg white 
Pineapple stem 
Rat skin 

Rabbit skin 
Bauhinia seeds 
Leucocyte cell, spleen 
cell 

Bovine nasal cartilage 
<fc 2 Serum protein 
(human) 

Immunoglobulin G 



Ascaris lumbHoccides 

Ascaris lumbriocoidcs 
Potato 

Bauhinia seeds 
Scopolia /aponica 
cultured cell 

Cartilage, aorta (bovine) 
Bovine nasal cartilage 
Rabbit tissvtt in culture 

Porcine aorta smooth 
muscle cell in culture 

0| serum protein 

Rat liver 
Rabbit bone 
Potato 

Ascaris iumbriocoides 
N. crassa 



Molecular 



{ 



J 2,700 

5,600 
74,000 
13,400 
12^00 
24,000 
15,000 



13,000 
90,000 



15,500- 
17,500 

27,000 
24,000 
4,000- 
€.000 

11,000 
22,000 
27400- 
29400 



40,000 
30,000 

4,300 
, 7430 
[10.000 

5,000 



Ref 



53,54 
55,56 

57 

58 

59,60 
61 
62 

63 
64,65 



66 



67-69 



70 
71, 72 
61 
73 



74,75 
63 
76 



77 
78 

79 

80 
81-85 
67,86 
87,88 
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$nkibitomofi£erhievBroteiMues 

Xhe n^ber pf ^ of serine proteinases far ex- 

classes of prote^ It is not clear whether this relative abundance is a 
ii^e reflection of distribution of inhibitors in nature or only of the prefer- 
ences and convenience of the biochemists who isolate them, 

cqnaderabfc^ 
^own that these ^ 
J^^,(pro^ 

^aecsiidai^ato^ 

proteinases, stfae^s^^ 
shaie^eommon^^ 
substrates and are M 

It is not yet possible to decide whether several heavily studied inhibitors— 
the rather small stable thrombin inhibitor from leeches, hirudin (89), and 
the class of large, relatively unstable inhibitors from m»mmfK» n plasma — 
also act by the standard mechanism. These are examined briefly at the end 
of this review. 

The Standard Mechanism 

Inhibitors obeying this mechanism are highly specific, limited proteolysis 
substrates for their target enzymes. On the surface of each inhibitor mole- 
cule lies at least one (more in multiheaded inhibitors) peptide bond called 
the reactive site (90), which specifically interacts with the active site of the 
cognate enzyme. The value of k~JK~ for the hvHmiv«i« nf ♦»,«, ~**\a„ 

- - — / — >u — J -J - — - w» £rvj««HM< 

bond by the cognate enzyme at neutral pH is very high, 10M0 6 M~ l s -1 (91 , 
92), compared to a typical value for normal substrates of about 10 3 M -1 ' 
r l . However, for inhibitors, the values of and K n are both many orders 
of magnitude lower than the values for normal substrates. At typically used 
concentrations and neutral pH, therefore, their hydrolysis is extremely 
slow, and the system behaves as if it were a simple equilibrium between the 
enzyme and free inhibitor on the one hand and the complex on the other. 
The equilibrium constant for the association is extremely high (in the range 
of 10 7 -10 13 M-») (91, 2). An additional property of the inhibitory reactive 
sites is that their hydrolysis does not proceed to virtual completion. Instead, 
a neutral pH, the equilibrium constant between modified inhibitor (reactive 
site peptide bond hydrolyzed) and virgin inhibitor (reactive site peptide 
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f^l Jf^ * ^ (92 ' 93 ~ 98 )' Since *e same stable complex is 
formed between the enzyme and either modified or virgin inhibitor, both are 
Aermodynamically equally strong inhibitors of the cognate enzyme (2) 

mS£ § ° * e Mte of ^plex formation from 

modrifed inhibitor and the enzyme is much lower than from virgin inhibitor 
and the enzyme. In a few cases, this difference is so great that it led to the 
^sumption that die modified inhibitor was inactive, which longer moT 
tion showed to be an error. 

That the specifically hydrolyzed peptide bond in the inhibitor is the 
reacttve site is also shown by the following experiments: (a) Whfle L ever^ 

fomed carboxyl tominal residue (ft) always renders it inactive (99) tf) 
Specific jacylation of the newly formed aiiuno terminal reddue(POp^ents 

(c) KmeticaUy confrolled dissociation of the enzyme-inhibitor complex 
prepared with modified inhibitor leads to the enzyme and a mS of 
virgin (predominant) and modified (minor) inhibitor (101-103) 

oni? e th°rl meC ^ niSm * ** ^^bitor interaction ^'including 
I ^"^^tes whose existence was definitely shown, can bf 
wntten as (91. 99. 101. ifM 1 in i in. y """^ 0811 * 



, ™ <WA»l« 

wntten as (91, 99, 101, 103, 110, 111): 



where E is the enzyme, I and I* virgin and modified inhibitors, respectivelv 

i * • ( ^ c « stable enzyme-inhibitor Comdex Tt il 
applied to the analysis of this mechanism uniques are 

J? in ^f* to most mv «%ato« is the detailed nature of the 

sfcble complex C While a number of inferences can be made hTtht^ 

t^S^V"** the m^ft^ng 

absence of iSSSK (S^X ^^J 11 ^ prcSence * nd 
^e^ 



JUL 
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PROTEIN INHIBITORS OF PROTEINASES 603 

portion the fit is excellent and numerous van der Waals interactions, hydro- 
gen bonds, and salt bridges are formed. In each case complex formation 
occurs with relatively small conformational change; it is predcraunantly a 
lock and key interaction, although a small conformational adaptation of the 
inhibitor (presumably in the L -> C step) does take place. 

The detailed molecular events at the active site of the enzyme and at the 
reactive site of the inhibitor are not as dear as some facile statements imply. 
Here, only the highest resolution work of the group led by Huber, summa- 
rized in several reviews (121-123), is considered. The scissile peptide bond 
(Pi-Pi) of the inhibitor is intact in the complex. However, the carbonyl 
carbon of the reactive site peptide bond (P, residue) is not fully trigonal but 
is distorted about halfway toward tetrahedral. (The older view that it was 
fully tetrahedral disappeared with refinement) Since the carbon is 2.6 k 
(much longer than the 1.4 A covalent C-O bond, but shorter than a van 
der Waals contact) from the Or oxygen in the catalytic Ser 195 of the enzyme, 
it was believed that it was this oxygen that caused the tetrahedral distortion 
of the carbonyl carbon. This view was abandoned as it was realized that 
anhydrotrypsin (124) (where the catalytic Ser 195 residue is dehydrated to 
dehydroalanyl 195 ) forms stable inhibitor complexes of comparable strength 

(124) , with essentially the same geometry (119), and with tetrahedral distor- 
tion precisely the same as in a complex with trypsin (118), Therefore the 
partial tetrahedral distortion is currently believed to result from the attrac- 
tion between the carbonyl oxygen of the inhibitor and the oxyanion hole 

(125) of the enzyme, the NH's of Gly'» and Ser 1 ^ The carbonyl carbon 
is now poised for nucleophilic attack by oxygen 0* of Ser 195 , but in the 
stable complex the attack has not yet taken place. 

In recent studies 13 C was introduced into the carbonyl carbon of the 
"a^«te 0-carbon of the P, residue) of soybean trypsin inhibitor (Ku- 
ruiz, v ,2u, 127) aud of pancreatic trypsin inhibitor (Kunitz) (R. Richarz, 

Tschesche, and K. Wuthrich, personal communication). If the carbonyl 
carbon changed from trigonal in the free inhibitor to the completely cova- 
lent tetrahedral mtermediate in the enzyme inhibitor complex (the oldest 
of the three models) complex formation should be accompanied by a huge 
(-50 ppm) upfield chemical shift. In fact, a very small (-1 PP m) downfield 
chemical shift is observed by all three groups that did the experiment The 
complete tetrahedral intermediate is thus excluded. It is not yet clear 
whether these results are compatible with the X-ray results as currently 
interpreted. 3 

In our opinion the X-ray crystallographic studies do not explain as yet 
why inhibitors are inhibitors, not just excellent substrates. This is a complex 
problem, since strong inhibitors of bovine trypsin are simply very good 
substrates for a homologous trypsin 1 from the starfish DermasteriZ im- 
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bncata (92, 135). Human cationic trypsin is intennediate in behavior be- 

site of an mhibitor applies to many homologous enzymes. Some of these 
partition this value mio verv low k »n/ rr T 
Lu-.ui^j ^t. T y IOW *«it ana K m and thus are stronelv 

touted; others show more conventional k m and K m values andtte 
inhibitor serves them as a substrate. , m ^ tte 

THE NATURE OF THE REACTIVE srn? Th* y 

P^de^^S^^^^-^'iaUMtW-tl... two 
hydrolyzed by 




bpti 

STI 
SSI 

Hypothetical 
substrate for 
subtilisin 



-86 -28 
-46 -21 
-123 141 



-64 
-77 
-52 



152 
136 
134 



-J 16 
-89 
-92 



<-*S7) (172) (-S4) (J62) (-n 6) 



8? -135 166 -116 87 
8 * -117 169 -120 78 



(-62) (136) (-U2) W 0) 
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Table 3 Alternative amino acid residues In the sequences surrounding the reactive site 
(arrow) of inhibitors 



'0) 



Pancreatic Trypsin 
Inhibitor (Kunitz) Family 6 



Pancreatic Secretory Trypsin 
Inhibitor (Kazal) Family *> 



Streptomvces Subtil isin 

Aim iui cur rami iy~ 

Bowman-Birk Inhibitor Family* 



Soybean Trypsin Inhibitor 
(Kunitz) Family* 



P 4 P 3 h W P 2 P 3 P 4 P 5 P 6 P 7 P 8 



P 
L 
R 
N 



K 
R 
Y 
M 
L 



« . .G 
V 
A 
L 
0 
M 
F 



P 
N 
T 
A 
M 
I 
S 



* * «M 
A 

...A 

M 
V 
I 
I 
S 



P 
T 

T 
A 



R 
K 
E 
D 
A 
L 
S 
M 
V 
Q 
Y 

M 
K 



R 
A 
S 

y 

K 
F 
H 
N 
P 

y 

L 
P 
F 
H 
D 

Q 
M 



Y 

F 

N 
N 

I 

Q 
Y 



0 P V 



C. 



L... 
V 



P 
G 
A 



• P S Y R I R F J a E... 



Y. . 
F 
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Table 3 (Continued) 



\ P 3 P 2 p l 



Potato Inhibitor I Family* ,,.p v T L 

N 



Potato Inhibitor II Family* 



Alpha-l-Proteinase 9 
Inhibitor Family 



• A S Y K 



...A I P H 



P 1 P 2 P 3 p 4 p 5 P 6 P 7 P 8 



0 Y R C N R. 
F 



S V C E G E. 



T I P P E V. 



°'*^Apo/^^ from «• «• 

(egg wm«e; »*<*lckcri«,*xo«elna*lS^^ ™ d ovoinhibitor 

inhibitors of StreptonSL wSSJStiS ITm * * ' i P ° rCmC * mind > lasm * 

var. Dan Shaku Imo (179-181). from: Var Rws,M Burta «* <>". "8); 

'Inhibitor of a^tcina* inhioitor fa(nj|y ^ {mm ^ ^ ^ ^ 

S^^^^ feaCtiVe hydMl ^> loss of the 

f^dfcS I ^S?*' Stf0ng "talent bonding can substitute 

2? ? T ld ° f ^o 1 ^ Sulfide bridges, could still o^y the 

s^^sasr of m *— ■ *- » 

IleiractivesiteTOidue, P„ ^eraHycoiTOpondsiotliesDccilicltvof tt,. 
and trypsm-hke eiizymes (2), those with P, Tvr wl- y-ZT« • \ . V 
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very rough rules. In the Kazal secretory inhibitor family, inhibitors with 
P t Leu and Met are strong inhibitors of elastase (44); in the Bowman-Birk 
family elastase is inhibited with P x Ala, but not with P s Leu (37, 132). In 
many instances strong inhibitors of trypsin with P t Arg and Lys inhibit 
chymotrypsin on the same reactive site (37, 133, 134)* In other cases related 
inhibitors do not inhibit chymotrypsin, and failure to inhibit is not yet fully 
predictable. The most striking feature of Pi specificity is that exchange of 
Lys for Arg at this position, either by actual mutation or by semisynthetic 
replacement (104, 107), leaves the inhibitor specificity and strength approxi- 
mately the same. The exchange of Lys or Arg for a chymotrypsin specific 
residue generally changes the inhibitor from a good trypsin inhibitor to a 
good chymotrypsin inhibitor (105, 107), An odd exception is the rather 
strange behavior of Phe 63 soybean trypsin inhibitor (Kunitz) (105, 106). 

The lability to tolerate a synthetic or mutational re^ p, 
residue and stiU retain in^ 
specificity (eg^^ 
change in ? spec^ 
changes), is pecul^ 

proteim the replacement of a^ by closely related ones, 

ld&SWIlM 
lai^^bt^ 

most proteins and inhibitors dtf^ 

stt^tmn^ enough ex- 

amples are known, there is appreckble^ 

often even between orthoiogous prot^ related species [e.g, 

plasminostreptin (Lys) (136) and S-SI (Met) (137); bovine (Arg) (138) and 
porcine (139) (Lys) pancreatic The phe- 

nomenon takes its extreme form in third domains of avian ovomucoids (44), 

where the P. rPClHll* Jc tin* mncf cukuwt +/■% Av/%li*«4>%ri««-ir ~U****+~. ~£ _11 C£ 

residues in the domain — a phenomenon referred to as hypervariability of 
the reactive site. This case is only an extreme example of behavior observed 
in the evolution of all inhibitors. 

Although contrary to our earlier opinion, the P J residue in most inhibitor 
families can tolerate a broad range of residues, there are two exceptions: 
(a) It is generally agreed that serine proteinases seldom hydrolyze bonds 
with Pj Pro. Apparently, inhibitors with P| Pro are not active. Silver 
pheasant ovomucoid with . - . Cys Asn Lys I Ala . ♦ . reactive site sequence 
inhibits trypsin; highly homologous golden pheasant ovomucoid with . . , 
Cys Asn Lys 4> Pro . . . does not (W, J. Kohr and M, Laskowski, Jr., 
unpublished). Except for this case Pj Pro is not observed (b) There is 
stubborn conservation of P[ Ser in all known Bowman-Birk inhibitors. 
Semisynthetic substitution study (109) showed that Ser in this position was 



i 
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better than any other amino acid residue whose substitution was attend 
^e^pnsin^as^vanantofp^ 

£^"'2™ Ly» f Phe ... reactive site (140)Tamn^ weaker 

Generahza^ about otb^posn^ 
•eqwroes oisome inactive molecules dearly homologous to active inhiki. 

tained without Pro (sec Tables 2 audi) ^ ^ can be ob- 

, . Semisynthetic modifications also led us to the conclusion an* im.u . 
J soybean trypsin inhibitor (Kunitz) iw^52^^2Li 
P leads to inactivanon of the inMM^Z!. u. t ^ mo *** resuiue 
of the P,-p,' "JZ^I . T 1 ™^ P««nnably because reformation 

?ona!^duc^LT^ Simaarly ' ^ 

thepepSSn^^l 1 ^ ^ ^ough all 

M^M.E REACTIVE SlIES ON A SINGLE POLYPEPTIDE CHAIN 

«». are quite common. In the comn«ritiv« 

bte. tot years ofoarcbinghave yiSnL^S! T" |,W 

inters of know, J2 JS^!? 1 ','' *?* 

molecule, and is most cZK^ • ^ on »"* 

lent association of ^veSSd^T ^ m PJ otemch «^bynoncova- 

headedness « ^ **" ^ ^ 

y ° r at least two inhibitors. The subtilisin 
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inhibitor, S-SI, is a strong noncovalent dimer (144) that inhibits two mole- 
cules of subtUisin without dissociation. Similarly, potato inhibitor I is a 
tetramer which inhibits four molecules of chymotrypsin (145). While sev- 
eral other inhibitors associate (most notably Bowman-Birk inhibitors), they 
apparently dissociate during complex formation. Multiheadedness also re- 
sults from the presence of two separate chains linked by one or more 
disulfide bridges. Turtle egg white inhibitor testudin [a Kazal inhibitor (I. 
Kato and M. Laskowski, Jr., unpublished results)] and fi x bungarotoxin 
(146) (a Kunitz PIT type inactive inhftitor) are partial examples of such 
inhibitors. 

The most common method of achieving multiheadedness is by gene 
duplication. The first such case discovered was lima bean inhibitor (147), 
a member of the Bowman-Birk inhibitor family. Inhibitors of this family 
consist of two homology regions, each with one reactive site in a homolo- 
gous position. Even more striking examples are found in the Kunitz and 
Kazal families of inhibitors, In the Kunitz family, the COOH terminus of 
the inter-a-trypsin inhibitor consists of two tandem domains connected 
only by a short connecting peptide. Dog submandibular inhibitor consists 
of two, all avian ovomucoids of three, and avian ovoinhibitors (also inhibi- 
tors from avian serum) of six tandem Kazal type domains. There is weaker 
evidence for several other inhibitors with multiple tandem homologous 
domains. In most cases, domains can be separated by cleavage of a single 
peptide bond in the connecting peptide and are independently active. 

The tendency to form multiheaded inhibitors by gene elongation through 
partial or complete duplication (often repeated several times) is a general 
tendency of protein prolinase inhibitors and stands in reasonably sharp 
contrast with most biologically active proteins where a single active site in 
a single polypeptide chain is most common. The minimal chemical require- 
ment forsuch behavior is the dispensability of the NH 3 + terminal group and 
of the CxjQr terminus for activity. However, the biological driving force is 
in many cases, obscure. It is clear for the plasma inhibitors—the inter- 
o-trypsin inhibitor (158, 159) in mammalian plasma and the six Kazal 
domain inhibitor (39) in chicken plasma— that formation of multiple do- 
mains raises the molecular weight and prevents excretion and yet does not 
require the use of a large number of amino acids per reactive site. In feet, 
it is i surprising that several mammalian plasma inhibitors (e.g. a t proteinase 
inhibitor, ttl antichymotrypsin) do not have multiple reactive sites. No 
similar rationalizations are available for the Bowman-Birk inhibitors for 
ovomucoids, and for dog submandibular inhibitor where the physiological 
functions are unknown. 

Gene elongation by duplication is not the only way in which inhibitors 
become multiheaded. Red sea turtle inhibitor, chelonianin (Kunitz type), 
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(149) consists of two domains belonging to two different inhibitor families. 
Thus, elongation by gene fusion is another possible mechanism. The extent 
to which this mechanism is used is not yet known. 

The Inhibitor Families 

The number of known and partially characterized inhibitors of serine 
proteinases is enormous. They can be classified into at least 10 and probably 
^families. To conserve space we adopt the conservative view and d 
scribe oriy families for which at least one sequence has been unambiguously 
aetermmed, where the reactive site has been cleanly assigned, and where 

hst of such families is presented in Table 4. 
^i^ihemajorcnt^ 

fiundy (except possibly potato II mdAscaris trypsin InhiWto^pSSns 

ofaUm^dis^debridgesarecomple^ 

(or, mhibitor homologues) have additional interchain disulfide feT 

testudm and ft bungarotoxin). onages W 

THE PANOPTIC TRYPSIN INHIBITOR (KUNITZ) FAMILY Thisfam 
Uy is named after the first inhibitor to be isolated in ij^ta flt? 

^ennmed (184 185), the first for which reversibility was d^oWed 
n ^'* e first ^ obe ^ced (148), and the first whose threeSSS 
structure was determined crystaHographicallv both for ZftZlT^* 
and for the enzyme-inMbitorwrniplex^ 1 ^iXXt^tf^Z 

~ ™ v „ wmiy acio residues), soluble, stable, and its sequence 

F ?!!!! S0 / protc »^ 1 to«thati n hftitserinepro. 
temases and obey the s tandard mechanism 

I. Bovine pancreatic trypsin inhibitor (Kunite) family 
m Pancreatic secretory trypsin inhibitor (Kazal) family 
ill. Streptomyces subtflisin inhibitor family 
*PfeM«*WM^unite),iam% 

Vl" tZ^^SX^ fahib1 ^ ^^i*) family 
vi. Potato I inhibitor family 

VII. Potato II inhibitor family 8 

VIII. Ascoris trypsin inhibitor family 8 
IX. Other families 
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1 H=n \ 



(c) 



(e) 



n 1 rRI 



(f) 



(g) 



(h) 



rFfn I nrKi 1 1 dm 



Hi cFTTlr 



ZD 



(k) 



r7~i i — i- 



; Topological structures of proteinase inhibitors. Solid circle and arrow indicate, 
respectively, a half cystine residue and a reactive site. Inhibitors of the bovine pancreatic 
trypsin inhibitor (Kunitt) family obtained from: a, bovine pancreas, Kunite, (148); b> red sea 
turtle egg white, chelonianfai (149); c human totcr-a -trypsin inhibitor (158, 15% d. B-ehain 
of 0t bungarotoxin (146), 

Inhibitors of pancreatic secretory trypsin inhibitor (Kazal) family obtained from: e, bovine 
pancreas (138*/ canine submandibular glands (164, 16S);£ red sea turtle egg white, testudin 
(Kato, L, Laskowski, M„ Jr., unpublished results); h, avian egg white, ovomucoid (44)- i 
avian egg white ovoinhibitar (39), 

Members of other inhibitor families obtained from: j, Streptomyces albogriseolus (137); k, 
soybean, Kunitz (167); i, lima bean, Bowman-Bide (168, 169); m, potato. I (1T7 17&Y „_ 
potato II (179-181). * 
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and threwiimensional structure are known, it became a favorite for investi- 
gations by physical protein chemists, rivalling serum albumin and ribonu- 
clease. It has been the subject of extensive studies on protein folding (188) 
(time course of formation of disulfide bridges), of extensive »H and ,3 C 
nuclear magnetic resonance studies (189-191), and it is one of the favorite 
proteins for testing algorithms relating sequence to three-dimensional struc- 
ture (192-198). It is worth pointing out that many other protein proteinase 
inhibitors share all the desirable properties of pancreatic trypsin inhibitor 
(Kumtz). In spite of extensive studies, the physiological function of this 
inhibitor is not known. It is present in virtually all bovine organs (38) (liver 
lungs, parotid glands, spleen), but appears to be restricted only to bovids' 
and caprids (199). 

However, both cows and pigs have a homologous inhibitor, the colostrum 
trypan inhibitor (200, 201). The bovine colostrum inhibitor was sequenced 
(1 50); it is a glycoprotein with a glycosylated Asn residue (202). Its physio- 
logical function is controversial; one suggestion is that it protects colostrum 
antibodies against tryptic digestion in the newborn (203). Evidence both 
tor and against this suggestion has been adduced (204-206) 

The tota-a-trypsin inhibitor in mammalian serum is a huge molecule 

it a uuUH-tenmnal fragment of mol wt 14,000 (207). This fragment was 
sequenced from both bovine and human mter-a-trypsin inhibtoOH 
m LB .consists of two fendem domains, each homolo^s to bcZepal 
SSSi3T "Aibitor (Kunitz), connected by a short connecting pepSS. 

t^T ultimate domain of mter-a-trypsSiib- 

itor) has Met (human) or Uu (bovine) as a P, residue. SurprisinSTit does 

uJbrnate domain in mter-^trypsin inhibitor) has Arg as P, in both human 
and bovine material. It inhibit* trvmin tu- .JL 

^Htern^aldoniamsofmter:^ 

s^nSrlr^^^i^ FnigmentationofinS^ 
Srf^T ° CCUrS . m ™°- ^ f «g«ents in serum are identified as 
aad-staMe serum proteinase inhibitor*. Because of their low molecule 

i^S^T ( } - SmCe many P«>teittase inhibitors consist of several 
^dependent domams connected by connecting peptides, these findmt so 

tTTi' SUg86St ""V ^bitorVwill turn out tot ft£ 

ments of krger ones. The presence of bovine pancreatic trypsin inbS 
tfunitz), bovine colostrum inhibitor, and at leasUwo KumSC TmZ 
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tfXtTt^^^^ (torins 1 B ' »* E) were isolated, 

. (1 ?t 15 5 1116 Stnkin « P' 0 **** * **» « that toxin K (P. Lys) 
and tan! I CP, Tyr) do not inhibit trypsin and chymofcypsin, redely 
but toon E <P, Lys) is a strong inhibitor of tryp^TSS St™ 

s«ak«<152, 153).& Bungarotoxin (146) consists of two polypeptide chains 
held together by an interchain disulfide bridge. Chain Aofl20 rSduTk 
homotogousto phospholipase A, but chain Bis homologous to&Su ^ 
mhibitors However, chain B differs from the other member* oi 
group m two important ways. The residue corresp<*ci^ 

p^umably responsible for the interchain disulfide. The fil bungarotoxht 
«™cto* shows that Kunitz type inhibitor sequences may be cSSS 
as^tod with protem sequences other than multiple KunTd^T 

intiu " ^^ 6Xample rf ^^on is shown by the Ku^vne 

mhtator ftom red sea turtle egg white, chelonianin (149) 

inhibits trypsin (presumably in its first domain where Pi Lw)andsubri 

is closely homologous to Kunitz type inhibitors, while the second domain 
is not homologous to any of the inhibitor families described Ifar St 

PANCREATIC SECRETORY TRYPSIN INHIBITOR (KAZAL, FAMirv 

munication), hurna^ ^and torWi^ ^ trasser » 1*^1 corn- 
inhibitors are s3 in Sm^S, i~ } ^'^^sequenccd The 
in the pancreati juiJ tW T*? ^ 



614 LASKOWSKI & KATO 



mWbrt enterokma* (if they did, trypsinogen could not be activated in the 
doodenum) and other pancreatic serine proteinases of its own snede^the 
inlnbtor would then be diverted from £ essential t^Sffi^ 

Aferge munber of low-molecular- weight inhibitors were isolated from Se 

o?Z. 1^ aad Seminal vesides of variou * M«t 
1^!, ff erred toaSaCrosin since acroTHw* SZ 

S ^^^^osomesaiS, 216). However, theph^S 
function of these inhibitors is far from clear o,Ji, ; ux:. 
seo^enoed,- one from porcine, ^mTpW J"" 

two tenrt^m irt,„i . . ^ umus V 1M » 214 ;- This inhibitor consists of 

headed & turfcm, n/invi, ^ ^ v 6 * cnicicenj, some double- 

f , . _ wiam > wu eiastase), and yet others ^ 

mains, in which p * Af 6 ^ ^ ^yfc Kazal-type do- 

iydrolwwL and „-„,,„,,": 5 °r ra >™ ™M domains can be readfly 

tached to four fin W^^T^. ^carbohydrate in ovomucoid is at- 

^Jilz^^sJZ^^r fiv f Cm « a » 

the third domains is mMfbZZT * sW X lation of the site io 
PWUaJ ' tterrfbre, after enzymatic hydrolysis of the 
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PROTEIN INHIBITORS OF PROTEINASES 615 

bectm^l^^ JTT, r" 1 ^ * «■* oviduct, ovomucoid, 
beeame objects of considerable interest to students of km bbt^T 
P™«n synthesis, (221 , ^ Chicken ovom^S a 

Aside from ovomucoid, avian egg whites contain another inhihW «r 
serine proteinases, ovoinhibitor (4H Thi* ,v mhlbltOT of 

smaller and more variX^ouW m ? Uch 

(226). It has been extensively studied f227-2ra • , ov0mucoid 

homologous, Kazal type domains (39 451 Th* p ™7 ft . dem> 
^ ^ tiypsuHype reactive sites are followed by three cnvmotrvS 

identical to that of chicken ovoinhibitor and with u ,7 T™ 01 * 
^ properdes wa, isolated from ^"(^£2^ 
ally indistinguishable from ovoinhibitor (40) and had thZT™ ° & ~ 
sequence for at feact +t» «_* w V nad ™» same ammo acid 

J—.h post-translartal bAS&uSZSXIT-"^ 

function of ovoiidiihit™, i. < „ i"™™* ™ "* pracipal 

avian plasma (a pnySo^ ^vX^™ fTT 
ovomucoid, andonp^ S^ST domawM « * ovoinhibitor, three in 

been described. TlJZo^hT^f^^^^^^^y 
taiibito,, calle^deUte Sf ^ r''^ aWKrwa 
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STREPTOMYCES SUBTILISIN INHIBITOR FAMILY Various species of 
Streptomyces excrete into the medium protein proteinase inhibitors, with 
different specificities dependent upon species. One, called S-SI, from Strep- 
tomyces albogriseolus, inhibits subtilisin very strongly and chymotrypsin 
and trypsin very weakly (235, 236). It was prepared in high purity and large 
amount and was chosen as an object for collaborative study by several 
Japanese research teams. The amino acid sequence was obtained (137), the 
reactive she assigned, the three-dimensional structure was determined for 
the free inhibitor (237) and is in progress for the subtflisin-inhibitor complex 
(1 14, 238). A large number of physicochemical studies on the free inhibitor 
and on complex formation were made or are in progress (239-242). Strep- 
tomyces antiplasminolyticus excretes a potent plasmin inhibitor plasminos- 
treptin. This inhibitor was also extensively characterized (136, 243, 244). 
The two inhibitors are strongly (70%) homologous. They consist of 1 13 and 
109 residues respectively, each with two intrachain disulfide bridges. The 
reactive site is located in the second of these. S-SI forms strong noncovalent 
dimers and dimers inhibit two enzyme molecules without dissociation. 

A striking observation is the pronounced sequence similarity in the reac- 
tive site region between S-SI and plasminostreptin on one hand and the 
Kazal family of inhibitors. In the region where it holds this similarity is 
sufficient to be called a homology; however, the remainder of the molecules 
appear unrelated. We are unable to decide whether the S-SI family and the 
Kazal family are products of divergent or of convergent evolution, and 
therefore, whether S-SI family should be a subgroup of the Kazal family or 
a separate family. In either case, the strong sequence similarity should make 
the comparison between them particularly useful in stnicture-to-function 
studies. 

More Streptomyces inhibitors are being isolated and studied. They may 
prove to be as eond or hptt**r than 

♦ v - - - — — — - ' wiuuiuww uuxu domains as a 

demonstration of hypervariability of the reactive site between closely re- 
lated species. 

BOWMAN-BIRK INHIBITOR family These inhibitors are readily iso- 
tated from the seeds of all leguminous plants. A large number of them were 

i?^?. 68 ?? (H) ' a considerable number were sequenced (168- 
176). All of the sequenced ones consist of two tandem homology regions on 
the same polypeptide chain, each with a reactive site (245) (Table 3} 
However, a significant difference exists between the Bowman-Birk inhibi" 
E^ "52£ re ? ons «• ^ by interhomology region disulfide 

'^^^ bMkn fr0m Kunit * P^tic trypsin! 
mhibitor related family and Kazal secretory inhibitor family, wherettere 
are no mterhomology region [inhibitor family, where there are no inter- 
nomology region (interdomain) disulfide bonds] (43). In spite of the pres- 
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ence of interhomology region disulfides, Deenaka and co-workers split the 
two homology regions by making two specific peptide bond cleavages 
(CNBr and a pepsin-catalyzed cut) (247, 248). Both homology regions are 
separately active, but are weaker inhibitors than the parent molecule, pre- 
sumably due to partial loss of rigidity. Synthetic decapeptides correspond- 
ing to the heterodetic cyclopeptide containing the reactive site were made 
(249). These are even more weakly inhibitory than the separated homology 
regions, but, nonetheless, they appear to serve as promising models for the 
study of the role of individual amino acid residues in inhibition. 

In most Bowman-Birk type inhibitors the P, residue in the first (NH r 
terminal) homology region is Lys, and trypsin is inhibited, while the P, 
residue in the second homology region is Leu, and chymotrypsin is inhib- 
ited. However, these residues vary a good deaL Particularly interesting is 
the soybean inhibitor D-H, in which both P, residues are Arg and which 
is double headed for trypsin. This inhibitor can be viewed as an ancestor 
of other inhibitors (171). Soybean inhibitor OH (172) and garden bean 
inhibitor n (173), both of which have Ala and Arg as their Pj residues, 
inhibit both elastase and trypsin. 

Many homologous Bowman-Birk inhibitors are isolated from each spe- 
cies of beans or even from a single bean (171). Some of these forms appear 
to differ only in the length of their NJH r terminal sequence, but many 
represent significant differences in sequence. Their tendency to form homo- 
or hetero- dimers and trimers complicates the estimate of the number of 
forms present. However, Ikenaka et al isolated and sequenced three truly 
different inhibitors of this type from soybeans (171, 172). 

sjOYBEAk i*^ The first plant in- 

hibitor to be well characterized was soybean tryplm mlubitor (Kunitz). Its 

isolation and crvst aHiya rinn inA ♦*,«,«• U« ~- i . . 

. , " — — - w «" bvuij/ic& wua trypsin oy m, 

Kunitz is one of the classic achievements of inhibitor chemistry (1 85 250) 
The sequence (167, 251, 252) and three-dimensional structure of the porcine 
trypsm-mhibitor complex were determined (1 12). This inhibitor served as 
the main substrate for work (largely in this laboratory) that established the 
standard mechanism and permitted development of semisynthetic reactive 
site replacement technique (104, 108). It is, therefore, surprising that this 
inhibitor stands rather alone-there are no reports of homologous inhibi- 
. comm0n leguminous plants. However, a strikingly 

similar inhibitor from winged bean (253) was recently described, and inhiW- 
ors from nee (254) and from barley (255, 256) are probably homologous 
to soybean trypsin inhibitor (Kunitz). 

Soybean trypsin inhibitor (Kunitz) is often confused with Bowman-Birk 
TTT' ^ ?* tW ° The polypeptide chain in 

single-headed Kunitz soybean inhibitor has 181 residueVand only two 
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disulfide bridges (see Figure 1, k) while Bowman-Birk is typically double 
headed, has 7 disulfides, and only about 70 residues (Figure 1, /)• 

OTHER INHIBITOR FAMILIES We recently proposed (45) a list of inhibi- 
tor families in which we assigned the trypsin inhibitors) and chymotrypsin 
inhibitors) from Ascaris to two new separate families. It is now likely, but 
not certain, that one new family will suffice. Ascaris lumbricoides is a large 
parasitic worm, one type of which infests man and another pigs. These 
worms contain inhibitors for all of the digestive enzymes of the host (67). 
Pepsin and carboxypeptidase inhibitors from this source are listed in Table 
1. One or more inhibitors of trypsin and several related inhibitors of chymo- 
trypsin have been reported. The trypsin inhibitor isolated from Ascaris that 
infest pigs is an inefficient inhibitor of human trypsin, thus raising the 
possibility that this property may account for the inability of this species 
to infest humans. The sequence of this inhibitor has long been known (257), 
but the position of its reactive site was only recently assigned (R. J Pea- 
nasky, W. J. Kohr, and M. Laskowski, Jr., unpublished results); in the 
process, the sequence was revised. . 

The chymotrypsin inhibitors of Ascaris appear to inhibit chymotrypsin, 
elastase, and subtilisin at the same reactive site (67). This behavior is similar 
to that of Kazal-type inhibitors (e.g. turkey ovomucoid third domain), but 
is in contrast to that of chymotryptic sites of Bowman-Birk type inhibitors, 
where only chymotrypsin is inhibited. Recently, both the preliminary se- 
quence (258) of several Ascaris chymotrypsin inhibitors and the location of 
the reactive site (259) were announced. 

Another extremely rich source of proteinase inhibitors are potatoes and 
related plants (9, 1 1). Many inhibitors, including carboxypeptidase inhibi- 
tor (Table 1) and several families of serine proteinase inhibitors, are present. 

JThe best imrforstrwl on> ttio ~~*„*„ t i_i_'t !. . • . 

. . ~ — ;~- *~«»w t. uuuviwis, wmcn are noncovalent 

!*? m ^ f f Smgle * chain inhibitor only a angle intrachain disulfide 
bridge (260, 261). Far more complex is the potato II family, where arnino 
acid sequence shave been reported fora 'low-molecvn^-weightchymotryp. 
sin inhibitor (262), for an inhibitor from eggplant (263), and for inhibitory 
/ITS °i ?° Iarger P rotemasc inhibitors, Ila (264, 265) and lib 
(180, 266> When all of these are aligned, homology is apparent. However, 
in contrast to other inhibitors, reactive sites (267, 268) do not align and 
there are many postulated deletions. At the present stage, therefore, the 
designation of a potato II family may be premature. 

At least 100 small inhibitors not reported in this review have been de- 
scribed for serine proteinases and in some cases rather intensively studied 
P^P^ t0 P hce into families, but it is clear that new 
families will have to be created for some of them. The benefits in obtain^ 
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enough information to allow for a family assignment will be quite large, 
since comparison of inhibitory specificities or of various thermodynamic 
and kinetic parameters of enzyme-inhibitor interaction is far more valid 
within a family, 

INHIBITORS IN MAMMALIAN BLOOD PLASMA Many proteolytic pro- 
cesses (e.g. blood coagulation, blood clot dissolution, formation and de- 
struction of peptide hormones) occur in blood plasma and require exquisite 
control. It is not surprising, therefore, that plasma contains many kinds of 
proteinase inhibitors— some in very high concentration. The principal ones 
in human plasma are listed below. 

Of the inhibitors listed in Table 5, 02 macroglobulin is a general endopep- 
tidase trap and its postulated mechanism of action has already been dis- 
cussed. Inter-a-trypsin inhibitor chain terminates in two domains belonging 
to the pancreatic trypsin inhibitor (Kunitz) family, the presumed reactive 
site lies in the COOH terminal domain. The other inhibitors listed in Table 
5 do not appear to belong to any of the families discussed thus far. Further- 
more, the sequence is not completely known for any of them and then- 
mechanism of interaction with serine proteiiiase--in spite of a large body 
of literature dealing with this subject— is either unknown or highly contro- 
versial. This comparative lack of information results from the compara- 
tively large size (required of all plasma proteins) and high lability of these 
inhibitors, which complicate isolation and experiments that are readily done 
with small stable inhibitors. As a result we resist detailed review and offer 
only thumbnail sketches without thorough analysis. 

By far the most studied is a r proteinase inhibitor (often called 0l anti- 
trypsin-* bad name) because its genetic deficiency strongly predisposes 
individuals who are homozygous for its Z allele Cm contrast to the normal 
M) to emphysema (270) and to liver disease. The Z inhibitor is fullv active 
out it is present in serum at 20% of the normal concentration.' Partial 
sequencing shows that Z differs from M by the substitution of Lys for Glu 
in one of the internal sequence positions (271-273). In contrast to small 
mmbitors discussed earlier, a r proteinase inhibitor contains no mtrachain 
disulfide bridges (131). a,-Proteinase inhibitor inhibits an exceptionally 
broad spectrum of serine proteinases (e.g. trypsin, chymotrypsin, and elas- 
tase). Acomplex with one of these enzymes does not inhibit any of the 
others. This observation may indicate that all of the enzymes act at the same 
reactive site; a number of alternative explanations are also possible. If a 
single reactive site is accepted then good, but not compelling, evidence (1 82 
274) was presented that the site is a .. . Met-Ser ... bond in the NH 2 - 
termmal portion of the molecule (the sequence surrounding the reactive site 
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620 LASKOWSKI & KATO 

Tables Principal proteinase inhibitors in human blood plasma (269) 



Name 



Concentration 
mg/lOOml 



^•proteinase inhibitor 
«j an tichymo trypsin 
inter-**- trypsin inhibitor 
<*2 antiplasmin 
antithrombin III 
Cj-inactivator 
o^-niacroglobulin 



290t45 
49 * 7 
50 
7-1 
24 ±2 
24 ±3 
260 * 70 



Moiwt 



52,000 
69,000 

160,000 
70,000 
65,000 
70,000 

720,000 



Number of 
polypeptide 
chains 



^Maximal number of enzyme molecules simultaneously inhibited by 



Heads 2 

1 
1 
2 
1 
1 
1 

1-2 or more 



cule. 



one inhibitor mole* 



The most recently Ascovered (275-277) of the major plasma mhftritois 
™JT? ^ »«* ^owly with several trypsin-like T- 

zymes but its reaction with plasmin is exceptionally fast However its me 
of associadon is reduced (278) to more typical values 

f S S,tC ° r by addition of mod ^ »ow conceptions 
*S£Z ■ h JSS?^"^ ^ P^ 02 »tiP^-mhibitS 
US? ? 3^? by SpCClfic kteraction between the mhibitor and the 
strong lysine binding site on the heavy chain of plasmin 078) 

thrtb^^ 

^rombin-^thrombin reaction is moderately slow in absence added 
heparin; addition of heparin greatly accelerates the , cS2 

consensus appears to be that it is the antithrombm-hepa^compS 

ucpwin complex is rormed, most of the henarin 

«- to combine «k additional V^wSTE 
7"*f k "*» C82, 283). An essentially CfflnL^aS acM 
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^gonal,&^af 13 A)!^^g^t^^^|nd" « 
have been refined to owtolt^ifclt » i " ^ iesolu wo% They 
1*8%, respeSy. TtJ^S£<?'£ U * U 8 ; 9% ' 21.6% anil 

W the multiple iSrpJjSS^ ™S^ S ° lut ?" of 2 6 A 
the first Wgh^uKo^ ^^^L "f* 0 ^ ^ study provides 
proteinase S wtth^™^ a 5F ta b ? twcen • Kunite-type 

except the rSveTte "^^^1^ 
induced upon co^lexatioiT TftaS^cit ^ .7%^^ 
canonical conformation similar tafW^T a u P of 511 ad °Pte a 

retains a ^IJ^L^X^T^^^^^' and *« 
complex between a T^TniSST f? 50 "*^ Modeling studies on the 
W S ^ inhibitor D&3 

xeveal a^new fte S Z&t£^ P 1 ^ 0 ^ activator 

crucial role in thefbin^g ^ >«*«ctans which could play a 

© 1998 Academic 



^S^JSl^' bwiM P a ^& trypsin,- 
•»ril» bovine pancreatic trypsin inhibitor- En 
mhttor DE-3 from ErjS ^Jffi ™$P™ 
mhib tor a from &y HW*, varkgala; BTIb, t3n7 

MCllhypsa, inhibitor from bitter gourd Jefc ' 
«¥iac«nenb ortho, orthorhomV^ o^^; 

te^gonal crystal- tPA, tW^ p] dfi ^o^ 
activator; WCX Kwufcc-tvw* ch^E^T?^, 



Introduction 



Sf-J^ ^tz-type soybean trypsin inhibitor (SIT) 
^^jdoystalH^d as early a.fr ito (EST 

!®&J* 181 «*» acid Udi^hS 
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Structure of Kuniu-rype Soybean Trypsin Inhibitor 



the cDNA clones encoding the precursors for TP 
and Ti 6 was reported (Song el d., 1991) 

The mechanism of inhibition of PPT by SIT was 
proposed (Blow et O., 1974; Baillargeon id., 1980). 
A stable complex between the virgin inhibitor and 

&^oT 0 < ^J BRned " nd ******* very 
slowly fcito the free enzyme and a mixture of 

^odified and modifled inhibitors. The modified 
^ * *J*« fica % cloved at the sdorfte peptide 
bond between Arg63 (Pi residue) and Deo4 ^' 
£»due). ta the refined structuies of^rnpS 
between bovine pancreatic trypsin inhibitor (BPTT) 
arrfbc^ pancreatic trypsin (BPT), the scissOe 
peptWe bond [ remams nearly intact, with a slight 
^ i e f° nnatiDn of the carbonyl oZ 
atom ^observed (Bode & Huber, 1992). E 
ever, ihe absence of a large upfield shift in the «C 
1**", [of Sn upon complex formation 
** * e STLPPT complex b™ 
cova tent fully tetrahedral adduclZuargeon < 

^Stf^SH' no ou^f-plane distortion' 
around I the inhibitor's sdssile peptide bond was 
observed in fte refined struct^ theXj^ 

mhfcitor from the Squash family (HuangTot, 

gS^ 1 ^ n^n; wheai- at 

S ^ ^*^ P Inhibitor (MareS 
SSSi^^iS^'^'* Murdy, ? 1989) 

to ^^fecrq«totryr^ 
PJ^srnjr^gen.achvator (tPA)ptPA is currentiv brtiZ 

to group c and are of particular 
to inhibit tPA (Heussen et al. 1984) rm aT 
sample, provides an effective IffiS^JS for 
the one-step purification of bom onV a^TLT 
<^,forms oftPA (Heussen et a., im) 
The cr)«tal struchw of *emfwi4mi«, . 



A**-? hW,or couW be bu «t' since the 
™ ^ r rc «ta*»*7 incomplete (56%) 
m ^. 3 :°./° \« A range and a significant part of 
Stfl^^T* ^ P~^«t«m density 
22^,2?™ ha t sbeenno "Tort of a highly 
refined structure of any Kunite type p«otemase 
gtartar in complex with ite JepS^ 
Rn^rmore fte structure of STTin tr/fa* aa* 
has not vet been reported, precluding a detafled 
«ttgs of structural dumgesWia^S Jaf££ 
plexanon with a proteinase. 

J^i efore ' *a« exists a need for high-resolution 
cr^taltographic analyses of Sn, in cJnpfcx wifo a 

Provide a high-resolution picture of Sn and also to 
gafa msight into the intentions betweenVStS 
type proteinase inhibitor with a serirTprotEe 

inhibitOT.aJor«:at ^ 

*^at l;9^ The structure'of 'SmmwS, 

Results and Discussion 
Quality of refined models 



Table 1 summarizes the refinement statistics as 
Mi trOFPT complex m two crystal forms and free 
PFT S ^SS^ 6 ™"?* fa paper, residuesof 

Missing from the inhibitor part of STLFPTtcrth^ 
to 1811 5 f"? C-fennhial residues 078 

WD «e stm r^f^^^ u ^» *? 
mcjiei are two t<-i*(31fo%t Jg 

the electron density for the siifediaira tfH. u 

154, 1(55 ^ 144 ' 

Gtal38, in the free Ml So<Stm ^ T du 5 
^on of ^chandTplot ^^Z% 
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Table L Refinement statistics 



Resolution range (A) 

No. of reflections (F>2o) 

Crystanographic K-factor (%) 

Free K-factar 

No. of residues 

Noof non-hydrogen atoms 

Protein 

Water 

Average B-iatim (A 2 ) 
Mam-chain 

Wafer 

or- 

Rjri&. deviations frpm ideal 
geometry 

Bond lengths (A) 
Bond angles fl 
fomachardfan outliers 



^nJW(ortho) 



ao-1.75 

3UJ3S 
18.9 
21,4 

171 (5TI), 223 (ITT) 

1268 (STt), 1630 (FPT) 
142 
1(PPT) 

34.4 (S11X 19.6 (EP1) 

372 
19*6 (FPT) 



STlPFTXtetro) 



O013 
1.32 
0 



* The free R-fcuxor was calculated using 10% of the dafT" 



8.0-1,90 
23,967 

21.6 

28.0 

172 tfH), 223 (PPI) 

1286 (STQ, 1630 (PPI} 
139 

as^csnufciofrpi) 

34.6 (STI), 20$ (PPT) 
34J2 
33.6 (PPT) 



0.011 
1.68 
0 



FrecStt 

8.0-230 
9079 
19.8 
26.9 
172 

1^81 
44 



37.1 
394 
36.0 



0.012 
1.40 



lt * 1101 weU defo * d ^ its * angle lies 
sbghtty outside the allowed regzaa M P 5 

f degree of confidence but onlv a 0» fnrinJ 

tamU* for ^segments M I and uoS 
^t'.J^ a " other stretch of density ^ ^ 
lively assigned to residues 116 to 12l In therZ. 

107 and 115 has been unambiguously traced. 

SIT a"*" ? r 5 e ™ in ^ am atoms of 
»Ji in both the complexed and the fW eta** *~ 

JfiTrar •*? ^ " * ^ 

• u J h "Acates a greater decree n t 

J«or for the njairnchab, atoms of each residue J 
SH models .s plotted as a function of residue Z£ 

models and also the N terminus for the taTinWw 
^ regions with high B-fecto^rr^^ 
loops connecting Mrands. The etotoST h2JS2! 

5£ ^ "^P? with ^ B-factoreare weak'b^t 
o^lf^ffi^^aS^ 

Asnl3 shows B-fectors S J^^f aiound 
Jjg- ?)• This is duftote^eX^fZ 
residue in the network of hydrogen bonds which 

We final Of.-FJ electron density calculatS 
™& for the Sffi 

rfStt in the STLPPT complex in the oiaShS 
ttystal is very well defined, reflecting the tow B 
factors around this region (Figure 2) 




I 




Residue number (ST1) 



«» <» M> t» l90 



Overall structure of STI 

Sn is roughly spherically shaped with arwnvj 
mate dimensions of 45 A 5 42 AT40 X fRr" 
structure consists of twelve -ntfp^fJM? 
^1? toops connecting these aKf ' 

«escnoe<i tor the structures of En (Onesti 



£ Orthorhombic crystaJ, ft) £ £lS^ r f°7 ,ple * 
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Structure of Kunitz^ Soybean Trypsin inhibitor 



mm 




VJ* 



15.0 to 1.75 A data and contoured at 1.2 o. 



^d" (McLachlan, 1979; Murzin et */ 
1992). has also been observed In the structures of 
nonhomologous proteins Including interieukhvla 
STSL* ?" 1990 >' "toieukln-ip (PriesUeTd 

J™ fib ™^ st growth factor, (Eriksson a V 
1991; Zhang < 1991; Zhu < 1991) 

3-fold internal symmetry, with the synunettv «fa 

is a fouMtranded motif consisting of abott 60 

„»Jdr ^" _P4 (where L denotes a loop con- 
nects consecuti^ ^strands). A superpositicm of 
the three subdomains shows high d™ 0 fX°! 

^ (Figure 4). The r.nus. deviations beS 

t 10 21, 28 to 32, 41 to 45 in subdomain A: red- 
dues 72 to 77, 92 to 96, 103 to 107 in s^domate^ 

X^if to J*tt7 to 151, 159 tol^uS 

domain Q are 1.0 A hot™*^ . . * uc 7 

and ^ respectively. In contrast to the presence of a 

Hl^ tT^ 84 to 86 m o"r ?n modi 
. and WCI do not contain a ^Mix hTtto* 
* *» f» three^sidue ffiSs Si 

£d<Tfr^ J* 3.0-heHx In STT pro- 

trudes from the protein core (Fismrp n ST 

bond after Met84 1 suscepZe to de7vai^? Su ? 

Sh£ 3fter J Met84 ' * te «"» active asTbSht 
mhjbitor, and when both peptide bonds 

ment between De64 and Met84 does not dlssodate 
3(a)). ^ disu £ 



■ % w /*• <r a y 




ZL. °.*f of .STL The PI residui AnrfTS 



^ ^mty of th. loop ^ons by cross- 
Solvent structure 

p£ S"gE 0fOT ^ ib complex with 

r^X^j 1.' 1974 )' "o solvent structure was 
reported. In this study, all three struchm^f 
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tetragonal crystal forms and show WactOM lower 

«nng mt^ral water molecule is present in the een. 

water moS^pSr^M^ 
^ a wafer molecule which n^^^^S 

1991) is not observed in the S7T models. v 

Comparison of Inhibitor structures 

ft6 refined structure of STI in the STT-PP. 



Fignre 4. Stereo diagram showing 
^J^fP 02 ^ of C° atomt 
among three subdomains (A B 

and |reen taes represent domains 
Ar B and C, respectively. The 
(1 to 4) indite the four 
frstwnds in each subdonwin. 

plex (Sweet «t <1974), which were kindly su«- 

deference 'of 602 A. ^Ctt'^^S 
^udy. A summary of the superSS 



All 





[V430 

\*feB4 




dances betw^ ^ ^ ™ 
atoms of the inhfljftor am 
a hydrogen bond in parenthesis! 
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relatively flexible reactive site loop as well as the N 
terminus of STI becomes rigid upon its binding to 
trypan- Sraular behaviors have been previously 
observed for the reactive site loops of other inhibi- 
tors (Read & James, 1986). 

A C° superposition of STI models in the tree 
state a nd its STtPPT(ortho) complex gives an 
Tin* • chfference of 0.68 A far 171 C atom pairs 
(1.09 A for all 1262 non-hydrogen atoms). A plot of 
the distance between equivalent C atoms as a 
hmcbon of residue number is given in figure 7(a). 
Residues showing deviations greater than 1,0 A for 
^atoms are 22 and 23, 98, 111, 124, 128, 138, 144, 

the free SH and STLPPT(tetra) models gives an 
^j^f 08 of 079 A for 170 C* atom pairs 
^u 1248 "on-hydrogen atoms). A pl« of 
the distance between equivalent C atoms as a 
function of residue number is given in Figure 7(b) 
Residues showing deviations greater thanl.0 A for 



CV atoms are 48 to 49, 124, 144, lS 5, 166 and 167, 
*e large difference are 

^S^a* i~ T™ 1 l00pS ™ d * e C-terrninal 
residue Aspl77. These regions show high Sectors 
for Ujemauvcham atoms (Figure 1), indicating that 
*e coordinate errors of these regions are larger 
than the average. Thus large deviations in these 

^^V*^ due „ to *» ^ coordinate 
errors but they also reflect the inherent confor- 
r^bonal variability of these regions. In other parte 
of the chain, the magnitude of the deviation is 
roughly what is expected from the estimated coor- 
dinate errors. A superposition of the STl models in 
? n '^f l f K stmctun?s gives an rjns. differ- 
ence of 030 A for 170 C« atom pairs (0.75 A foTSl 
"hydrogen atoms). A plot of the distance 
between equrvalent C« atoms as a function of resi- 

& JESS' B *™ '"I 1 *** 7 ( c) - sh ^- 

mg deviations greater than 1.0 A for C" atoms are 
48 and 49, 111, 128, 155, 167, and 177. TrSTk^ 




HB3(P1) 



Figure 6. (a) Stereo diagramshow- 
>"g the superposifion of C" atoms 
among the free and completed STI 
T** 1 ^, Gwen lines represent the 
tree STI, while red and blue lines 
^present the inhibitor in STfcPP- 
T(ortho) and SnjPT(tetra) models 
fP^ely. (b) Stereo 'd^Z 
Shown* the sujx^oon d C" 
atomsainorig the STI femily inhibi- 
fQrs - Green, red and blue lines 
^oat SU ETI and WQ, 
J^^fy. Every twentieth resi- 
due is labeled. 
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4«U 



$n 

BTI 



ISSIHVTKUVCOHDEXDVIttOQTiaiHR 
* — CM-* ^ fn . 




Residue number (sm J 



J&£^££ equivalent e 

Atomsbeh^en ^ free and complex** SH models as a 



«-* 110 hi 

SIX DHDMtKSaffVSKN)9£,V\rQrQRLDKC$l 

TO D^K0TKKLWTEQy»LTVVuamCfl9 <40 3 tl 

^^g« for Sit secondary structure eternal 
S"f*£ M!d o3 m Rgure i. Reside conservedTfaU 
a* m bow and B* bullet (.) d££ 



jP°3»"2j«s contributions from the large eoor- 

different loop conformations due to different crys- 
taUin^roronents, as in the comparison betS 

snW^fff 5 "^ between the two modebVf 
S te^S of ^ orthorhombfc 

r^rSS^ of *« complex should be 

«grded as only minor. In particdar, an rnvs 

wUT be used for most of theltructure d^2 
below, unless otherwise stated. P ^ 

Comparison of free SV ufm ET1 and WCl 

structures fa showTin FiS^ J*^^™' 
identity between^I i 
that between PTr err B 57% ' wh «reas 

C deviations greater tharTs i.^tSn tTr^f 



Sf? 9 Z a ^deviation is 237 A for 
165 C« atom pans, with the residues 82 to 84 R* 

ai Wh2 IrT^f £^ deViafions S^«er than 
i When and WQ models are com™^ 
£ »»* deviation is 1.10 A feTss^S 
pans, only about half as laree as in Jl * • 
JJ «ft either En or 5£ftE *Tff3 
*e distance between equivalent C atonl « » 

figure s»;. other discrepant regions are lora^H 
mostly on the protrudirTlcoDs^n,^^^ 
«nere insertions or deletions occur 

In Ae case of ETI, the reactive site loon is dis- 
placed with respect to that of free SIT with l£ 
d^nce of 3.15Tbetween freC"Vton£tf n 
I** «»raponding distance betwee^ FT7 

loop of WCl lies between those of Stt ctt 

ler, with the distance of ISO. A betwfw, 74 
ffns of Pi residues in fre^STl SS^WO^S 

Sri S ^r*™ of ^ "active sWtaJS 
ET1 is largely due to the crystal r*ckml b?t J 

^ .oop. canit be'S&^^^^ve 
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R«Rlua number (Sn) 



Figure 9. Plot of the distance between equivalent C 
«oasl between theSH^ly inhibitors asTruS^ 
rradue number, (a) STLPPT<ortho) www EH, (b) Free 



the amylexation state. When C" atoms for P4- 
^nSffe.** ""P"^ the rms. deviation 
" 52 x J***™ *e free SH and En, while ft 
* 0 n *\ A the free STI and WCL C S>r- 

TKL***!?"* 1 6 ™* 0 ™ «• 0.36 A between 
the free SIT and STtPPT(ortho) and 049A 
between the free STI and SnWftebaT^ local 
S^f™* active ^ ijps of Sin 
and WO are, therefore, very similar and are little 
affected upon complex formation. 

^'!^!!1 S '^. , ? 0 P : ils conformation and 
— — n«»"(w • uunipi'exaiion 

foa^ wiuch possess an expc^ re^ ^ 



aim i of Asnp plays an important* role in stabiliz- 
^ 8 . 1 r E , r f C J hVe k°P <»nfonnation through a net- 
work of hydrogen bonds (Figure 10). Thte resuhs 
£*• "ST the^gion around AsnB 

£«ure 1). Similar stabilizations are also observed 
for ETI and WO, although the details tftaSES 
biding pattern are not identical For examphH 
water molecule resides in the reactive site of 
Wa forn^hydrogen bonds with the carbonyl 
O atoms of and SenS2 (Dattagupta ™T 
1996). An equivalent water molecufe is not 
observed in either the STI or the En structure. The 
backbone conformation of the reactive site loop of 
™£? upon forS a 

comple> !w,th PPT, as indicated by similar main- 
tain d*edraUngles for P4-P3' tesidues (TaS 

With HPT (Table 3). Table 3 shows that there fc a 
ferge delation of * and * angles at several pos- 

S ^TwS?. refined modek of STI and the 
7% °^?P ,nit3 «■»*«« with PPT (Sweet 

£± StJ^ 1*°?" *e conforrnatiS 
angles of several other inhibitors as well as their 
ammo aad sequences around the scissile peohde 
SS -^T BPT1, there is a ZffS 

*"g«e of P4 and <(, and ^ Zj^ f 
^: f °**Pf a , difference in the i»25£ 
ftt^ 1 ^ 4 ' ^position (argmine) for 
S>n and BPTT, both guanidinium groups point in 

el d., 1974). Between STI and MCTT, a large devT 

* angle of P2', whereas EH and WCI show similar 
conformations as STI at K. Notwithstandtal^S 




F^iostowdia^ showing 
the rob of Asnl3 in stabiiirin* q£ 
|£Crrv* site loop cMifeSffi ot 
?«. The conserved residue Asnl3 
^irjvolved in extensive hydrogen 
ponding with several reddue/in 
the reactive site bop. 
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Table 3. Comparison of dihedral angles {$/ jr) of reactive site loop residues 



P4 



P3 



P2 



PI 



sn 

ETI 

wa 
era 
Men 

FrccSH 
STI:PIT(tetra) 

shippixbiow) 
En 
wa 
BPn 

BPTfcBPT 

Men 



Pi' 



s 
s 
$ 

G 
I 

-112/149 
-114/144 
-126/149 
-58/127 
-88/171 
-125/-177 
86/176 
75/174 
-58/117 



P 
K 
Q 
P 
C 

-70/-28 

-S8/-34 

-67/-B 

-40/-39 

-66/-30 

-8S/-21 

-86Z-6 

-77Z-29 

-137/112 



Y 
L 
F 
C 
P 

-62/158 
-56/139 
-65/121 
-65/141 
-112/150 
-80/173 
-81/165 
-70/155 
-54/147 



R 
ft 
L 

K 

-84/21 
-89/38 
-70/10 
-93/47 
-66/134 
-111/76 
-104/9 
-116/39 
-91/34 



I 

S 
S 
A 
I 

-60/148 
-83/148 
-59/165 
-93/123 
-169/151 
-122/158 
-76/122 
-87/164 
-82/146 



R 

A 

L 

R 

W 
-90/-22 
-67Z-38 
-88/^29 
-165/-178 
-74/-46 
-89/-19 
-127/76 
-112/79 
-54/125 
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P3' 



F 
F 
P 
I 

M 

-127/126 
-U8/15S 
-115/170 

8/49 
-106/163 
-130/157 
-105/121 
-98/124 
-152/127 



STtPPT(Blow) f Sweet 4 el. (1974); ETT, OrtCsti et ol 
Marquatt <ff al (1983); MCTl Huang * of, (1993). 



0991); WO, Dtffcgupta et 4, (1996); BPTi Wiodawer * <d. (1987); BPTliBFT, 



atoxrw of residue P4 and ?V) is among the smal- 
tat (9.0A for Slj 9,2A for ETI, and 8,8 A for 
Waeompared with 113 A for BFII, 15.0 A for 
MCTI (squash family), 15.6 A for PSTI (Kazal 
fenaly), and 15.9 A for eglin c (PM family)). The 
reactive site loops of BPTT (Kunitz), Kazai/^uash, 
and PM families are constrained by disulfide 
bridges that could limit their coiiformational free- 
dom. No disulfide bridge adjacent to the reactive 
site is present in the STI and eglin c (Pl-1 family). 
Whereas (he reactive site loop of SIT is held in a 
favorable conformation by hydrogen bonds 
(figure 10), that of eglin c is stabilized by electro- 
stehc mteracUons between charged residues on 
both sides of the reactive site loop. 

The magnitude of overall structural changes in 
proteinase inhibitors upon binding the cognate 
proteinases ranges from very small to extremely 
krge. For the best-characterized BPTI, the r,ml 
nI^To^^ ^ ^ and complexed struc- 
tures is 0.26 to 017 A for 58 C atom pairs fflenm. 
er ihe rorus. difference between the ftee 

m SSI^btilisin BPN' complex is 0.63 A 
for 107 C atom pairs (Takeuchi et aL, 1991). At the 
other ^treme, the serpin family shows the most 
dramatic conformational change. The structure of 
cleaved ^-antitrypsin showed an unexpected sep- 

Ser359; Pl-PT) by about 70 A with incorporation 

(^ertnann et < 19M). A superposition of 54 C" 
atoms m the central 0-sheet of the cleaved and 
x •JF* Mt VP&i gave an r.m,s, difference 
of 2.4 A (Song « 1995). For STI, merml 
f n ^^ fi betw ^en the free and complexed irJubt 
tors is 0.68 to 0.79 A for 171 C* atom pairs. Tne 
dev^on for C- atoms belonging to pS"^ 
? e """J"* f te * waller than the 

mP^f^V °f A free STI anS 

™JZ ( ?£}°) a *<* 0.2$ A between the free Sn 
and SlT:PPT(tetra), 



Geometry of the reactive site carbonyf group 

The geometry of the carbonyl group at PI resi- 
due is of great importance in understanding the 
interaction between the inhibitor and the protein- 
ase i during the catalytic mechanism. In (he earlier 
and lower resolution structure of STJkPPT complex 
(Sweet * «/., 1974), a tetrahedral intermediate was 
ffl u .^.^P 0 ^ 1 W0S not < however, vali- 
dated by the subsequent hi^-resolution structures 

m ^£?&T» ot *y ^ght (Marquart 

t ^ 1983; & fiuber, 1992) or ne^S 
0^d cUf v 1983; Huang ef 19*5) ou^^e 
deformatians of the Pi carbonyl group were 
observed. Additionally, the carbonyl ffin atom 
^^i? b ,f Within me >b-vaivder-Waals" 
^^ ( % l ^ a ^ Und 27 A >*™** Serl95' 

^ ^ P ^(tetra) models, the distance is 
2.76 and 2.71 A, respectively. In order to obtain an 

^fijfT 1 &° m * i,c y of ^ e ?! carbonyl group, 
refinements were carried out either with or 
without applying the restraints to the carbonyl car- 
bon atom. With the usual restraints, the ou£f. 
piane distance of Arg63 C is 0-0084 and 0 0156A 
for Sn^PT(ortho)^ SThP^^^^l 

^T^J^ rl95 00 of W»wut the 

mvwJL^f^? 0024 ^ 0-067 A for 
SUPCT(ortho) and SHrPPTftetra), respectively. As 

l^ 01 *?? 9 *™* 1 *' ^^ente were also per^ 
formed after removing the restraints for the carbo- 
nvl carbon atom of P2' residue (Arg65). The out-of- 
plane distance of Arg65 C show? a cor^abk 
increase from 0.019 to 0.082 A for STT^rS 
or from 0.026 to 0.093 A for STI* fe25T2ffi 

chided ^attfie PI carbonyl group of SIT dispK, 
ft ou t-°^ e ^placement aJ^ul 

ft retams a nominal trigonal planar when «™ 
Flexed with FFT. This is in a^^thX 
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result for the MCTLPPT complex (Huang ol.. 
1993) and is consistent with STI belonging to a 
typical substrate-like inhlbilor iamily, whosl reac- 
bve site loop takes the canonical conformation in 
both Hie free and the complexed states. It is worth 
mmhotung that the out-of-plane distance of Lysis 
(PI) C ranges between 0.12 and 0.14 A for BPTI 
complexes (PDB ID codes 2 PTC, 1TPA, and 

Mod© of interaction between STI and PPT 

. ^ pattern of interaction between STI and PPT 
m F «°re 11(a) and is summarized in 
«i tlPlf 6 are wme , min or differences between 
the interachon pattern between the orlhorhombic 
and tetragonal crystal structures of the complex. 
Twelve ammo acid residues out of the 181 in STI 
make contact with PPT in the orthorhombic crystal 
T^i U £j he5r are: ^P 1 ' ^ Asnl3, Pro61 (P3) 
Tyr62 <P2), Arg63 (Pi), He64 (PI'), Xrg65 S 
^•.^TrpmandArgll^. hth^SgS 
crystal shi^rure, the three residues Ffis71, Pro72 

w£ J br ^ Mtogen bonding interaction 
site loop^idues from 
fto6I (P3) to Arg65 (P2 1 ) is well conserved 
' except that a hydrS 
r T * z ? ^ G W is not present 
m the tetragonal crystal structure (Table 4) 

. l i rth0r £ 0mblc ^J** 1 ^ STfcPPT com- 
plex, although grown under a condition different 



from that of Sweet rf at. (1974), has the same space 
group with nearly identical cell parameters. How- 
ever, the mteraction pattern observed in our 
orthorhombic structure is somewhat different from 

erence is mainly due to the fact that 
our model is more complete and is better refined 
at higher resolution, Th£ protein sequencirjdata 

W85) and the nucleotide semience of 

w ti^ 6 * i' 1991) amino 
aad residues at P4 and P3 positions in the early 
model of the complex, Pro60 (P 4 ) and Serfil (P3) 
are . e 5° n f > ^^e corrected sequence, Serfio (P4) 
and Pro61 (P3), was built mtaJur moddand I 2 
confirmed by the excellent electron densir? 
(Figure 2) The residues Pro60 (P4) and Phefifi (P3T 
tothe early mocHsuggested to form the region of 
fat^hon Wflt PPT, are not involved in mTinter- 
achon m both of our models. The two resides 
^ 117 .fl d ArgH9 belong to the segment 116 to 
was tentatively assigned in the early 
model of me complex and were not included in the 
residues parttcpabng in the interaction with trvD- 

more differences is given below. 

Most of the contacts between STI and PPT 
mvolve ft* five residues in the reactive Z 
(P3-P20. m our orthorhombk crystal structure ele- 
ven o« the 14 hydrogen bonaVLtw^S'^ 

m the tetragonal structure, ten of the eleven are 
accounted for. The PI residue Arg63 makes the 





Figure H.Stereodiagiflmahowine 
tt» interaction; (a) between STI and 
«td (b) between tTI and 

the inhibitor and the proteinase, 
respectively. All readues imS 
m the iiteracfon are labeled 

&3? ,mes dcnote 
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Table 4. Total interactions between STI aitd PPT 



FFT 



Aspl 

Asnl3 
ProSl 

Tyrt2 



Afg63 



Ue64 



ArgfiS 

Pn>72 
Tipll7 
ArgU9 



Lys60/ 
Pho41 y 
LysfiCT 

Hi$57 
Phe94' 

Uu99^ 
Aspics 

Ser214' 

AsplSy 

Serf*? 
Cysl9V 
Glnl92 / 
Glyl^ 
Aspl94' 
SerW 
Ser2H' 

GW2W 

GysZW 

Glym 

FheU' 

Cys4Z 

HisS? 

Glr^ 

Glym 

Serl^ 

His4C 

Phe4r 

Tyrl5r 

H1557 
Asn97' 

Leu9y 
Aster 

A$n97 



'No, of interactions 

STIFPKtetra) .J**, 



2. - 

3, - 

4,* (N02-OE1;3.46A) 
3,- 
U(0-N;3.I2A) 
16, - 

— ,l(OHt-q;2,97A) 
5,- 
1,- 
1,- 

1/- 

4,2(NHl-ODl ; :t70A) 
(NH2-OD2;3.»A) 
4,2(NHl-OG; 3 .nA) 
4,- 
4,— 
2, t (O-ho Z61 A) 

1/ — 
8,— 
U(N-0;3<G3A) 

1/- 
2 (NH2-0; 279 A) 
™> — 

1, — 

2, '- 
1, — 
I, - 

1, — 

^1C^^-0G;3.12A) 
M(NH2-Q ? 3.07A) 
- i CN-0;334A) 

3, - 

1, 2 (NE2-0; 3,08 A) 
1 / -•— 
t- 

2, - 



-,1(OD2-NZ;2.80 A) 
5,- 
3,~- 

4, 1 (CMC; 3.14 A) 



8,- 



1,- 

1,- 

<2(NHl-ODl;2.68A) 
(NH2-OT&2.95A) 
3,2(NHl-OG;*43A) 
5, — 

4, — 
2,1<0-H-2S4A) 

# ^™ 

5, - 

— , 2 (N-Q; 338 A) 
1/ — - 

— , I (NH2-Q; 2,82 A) 
3, — 

1,_ 

1,- 
3,- 

1, - 

l,I(N-OG;Z89A) 
1, 1 (NH2-Q; 2.99 A} 
— ,1(N-0;3.41 A) 
6,- 

2, - 



5,- 



ion 
a -a 



km 
ion 



'A n' " M(NE-ODi;3,2lAl _ 

eoge-tace inteacboq a-a- a«rmatie-NH group interaction. 



most ^tensive hydrogen bonds with PPT, forming 

Ag» occuptes ita ^pasted position in the primary 
*f ? f PPT. Tto guanidiurn grouTof 
A^63rnak8S an ionic interaction with thecarboxy- 
bte group of AspW in PPT. Hydrogen bonds ^ 

& ?i fr 1 ^ *ns 

«ain of Arg63 and the side-chain oxveen atom 

GtyZiy. Ttus differs trom the hydrogen bonding 

KLT Tt f? P/^^y (Sweet I *, 1974? 

b^^i CS i^f hain0fT y 162 ^ * Phoned 
between the side-chains of Leu99' and HisST lvm* 
parallel to the imidazole ring of the Utter [I 

form a hydrogen bond with the sidlS^ ^ 



^97" In the previous report (Sweet ei oL, 19741 
Even though the residue Prool (P3) was en^ 
neously assigned as Ser fa the early modeLd^ 

^^f^anglesatthispJtio^aablet 
andthe 1 hydrogen bonding interaction between its 

S ° *«™ and the maintain NaZ J 

(P2') interacts with TyrlSl' fa PPT wjssihlv ftST 
I s h jl«*- bond b^eeniuTbSe^ 

i*uuyz of FPT belongs to a stretch of eieht 
dues strictly conservj betweTI^ aS ^ 
ffigure 12) and interacts with as many as four 2? 

«^ us cor^rrnation is of great importance fat£ 
prediction of ETLtPA cornolex on^Zil u 
dogy modeling. Its ^^o^^T* 



JUN 2? 2^03 3:38 PM FR CISTI ICIST 



613 993 1395 TO 17188899320 



P. 14 



358 



Stmctvm of Kun'ttz-type Soybean Trypsin Inhibitor 



CHY XWCWAVPCSWWgVSLQMCT— GfKFOQaSLXHCBWTVZMflCCV TT 

m ivconcawis:infBWi,(i3 CSHrce«ia*$girwsjuuicYK s 



M * » » too uo 
CKY aovvvAKFOQGMsmaKi^wewra amp 

PIT I DVLESUEQFISAflia ITRPDfVCNtLCMDZKLTKlS^- — tXTV 

**A H tTVl MOT t*WPCaEE$« fgVg>CTIVHyEFTOOT YflWUJ. Ai l j f <^ r ^ 03500^9 

• lid 

190 KO uo uo 

PPT M»*v*IV5LEK-SOL-MS*e£M WW.WKI. SCfiS^TS UQCLMPVL5 W3S«K • 

179 '* 'W 117 200 )16 

CHV -KYlfOTJU KOWG CACA2 G«SSO*CMecoa»citB»CN*TXVCIV*»5 

PPT •SSYPCQXTeMtXCtferiBQ "<»B«OCD»gO¥VVQH--.W)W)OIVS*IC 

— • 

220 m HO «5 

CHV SrrCS -TSTVGVYAOTTALVttllVOQTUAII 

mt v-ocAQRwwvmcvcKyvinaQorAAp 

*TA L-GCGgtOtftarTT Wf| HI U W I RflgMRP- 

Figure 11 Sequence alignment of serine proteinases 
based on their thiee-dirneraional structures. The 
abbreviations are as follows. Bovine chymotrypsm 
(CHY), porcine pancreatic trypsin (PPT), and human tis- 
sue-type plasminogen activation (tPA), The numbering 
schenw is according to the sequence number of CHY 
Ihe bullets (•) denote the positions of catalytic residues 
Hus57, Aspl02 and Serl95. Several loop regions (L) of 
tPA are indicated. 



be assigned in the early model of Sweet et al 
(1974), because the electron density for the side- 
chain was not long enough and there was little 
room for the side-chain in the complex. In contrast, 
the side-chain of ClnW? in FPT is well defined by 
the electron density in our models. This residue 
wasjuggested to form a hydrogen bond with the 
P4 residue of STI (Sweet et a!., 1974), but this inter- 
action is absent in both of our complex models 
^ A . c ?^ contact between the aromatic rings of 
Kie2 in STi and Phe41' in PPT was noted ^rW 
(Sweet et al., 1974), with 13 interatomic contacts." In 
our structure, die two aromatic rings make an 
edge-face interaction, resulting in much fewer 
interatomic contacts. This kind of aromatic edge- 
face interaction is energetically favorable and isthe 
most common among the observed aromatic-aro- 
matic interactions (Burley & Petsko, 1985). A simi- 
lar interaction was found in the crystal structure of 
chymotrypsin complexed with 6-benEyl-3-chloro-2- 
pyrone (Ringe et al. 1985). The ionic interaction 
between Aspl and Lys6C was also noted earlier 
{Sweet et < 1974) but the side-chains were drawn 
unreasonably dose together by the real-spac* 
*ennement procedure- In our models, they are 
much better defined and there is no extensive van 
der Waals contact between them. 

A structural basis of tPA Inhibition by ETI 

(Collen & Lijnen, 1995). Figure 12 compares the 



amino acid sequences of chymotrypsm, PPT and 
tPA, as aligned on the basis of their trtree-dimen- 
sional structures. In this paper, residue numbers of 
all serine proteinases follow the chymotrypsin 
numbering for consistency. Human tPA shows a 
sequence identity of 41% against PPT. The recent 
crystal structure of recombinant two-chain human 
tPA showed that the Si pocket of tPA is almost 
identical to that of trypsin, whereas the S2 site is 
c^derably reduced In size (Lamba et al, 1996; 
PDB IO code, 1RTF). ' 

Although the three inhibitors STI, ETI and WCI 
show a high degree of sequence identity (Figure 8) 
only EH ts capable of mrubiting tPA. In order to 
gain a better insight into this specificity, docking 
studies were undertaken using the model of Elf 
after manual correction of the scissile peptide bond 
^JfiL^ canonical ctinfonnation (Lamba et al, 
1996). This kind of docking studies will benefit sig- 
nificantly from a high-resolution structure of a 
Kunitz-type trypsin inhibitor in its complex with a 
cognate proteinase. Our highly refined model of 
SIuFFT complex provides an excellent template to 
build a homology model of ETLtPA complex The 
hypothetical models of both ETTrtPA and STI'tPA 
complexes were subject to energy inmimization, 
alter the following changes were made to STI to 
rnuxOc ETI: removal of Aspl and replacement of 
Phe2 by Val and Arg65 by Ala. The results of our 
rrwdeling studies largely confirm those of the pre- 
vious docking studies (Lamba et al., 1996), which 
indicated that the exposed reactive site loop of ETI 
should slot into tPA's active-site deft from the P3 
to the P2' residue. Fuxtherrnore, potentially signifi- 
cant ionic and hydrogen bonding interactions 
involving the residues outside the P3-F3' segment 
have also been indicated by our studies 
(Figure 11(b)), in addition to the interactions 
between the reactive site loop of ETT from the P3 
to FT residues and tPA described previouslv 
(Lamba et al., 1996). p y 

N-terrnina] Vail ammonium group of ETT was 
suggested to be located dose to the distal polar 
groups of Glno^ and GhiWA with the Vail side- 
ctan ar^dung the Arg39y side-chain of tPA 
(Lamba et al., 1996). Our modeling studies indicate 
that the Vail side-chain is located above the ioni- 
cally mteracting pair of side-chains of Arg39 / and 
CluoffA, but the side-chain of Gln60' points awav 
from the Val] L side-chain (Figure 11(b)). This inter- 
action with GhrfO'A explains the much weaker 
bmding of the mutant ETT with an extra aspartic 
?™ *1™\ N t f rmmus fTeixera et al. 1994) This 
fcteraction is abo parti v responsible for STI and 
WQ being unable to inhibit tPA. SIT has an extra 
aspartic add residue at the N terminus and WQ 
extra aspartic add residues (Table 5). These 

r^rt" 3 l 81 !^ WCI * * e * terminus 
L^l fr °£ £P of molecule, when 
^S^t^L^F 11 modeL ^ protruding N 
f^^^STI and WO may restrict <Lr 
access to the active site of tPA by fnaking a dos£ 
contact with the characteristic^ inseru4 £j 
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Table 5. Residues of E71 involved in its interaction with tPA 



P3 



P2 



PI 



PI' 



(EH No.) 

En 

ETTb 

(STlNa) 
SIT 

wa 



VZ PJ' 



61 
R 



-1 



1 

D 
D 



2 
F 
D 



61 
P 
Q 



62 
L 

h 
F 

62 
Y 
F 



63 
R 



63 
R 
L 



64 

S 
S 



64 
I 
5 



69 

I 

65 
R 
L 



66 

I 

F 

66 
F 
F 



D 
D 
K 

69 
E 
R 



70 
0 
D 
G 

70 
G 
G 



71 

e 

D 

S 

71 
H 
S 



72 
K 
K 
L 

72 
P 
I 



116 
K 

g 
K 

119 
R 
K 



135 
K 
K 

139 

Q 
D 



(60toop) as well as by making a repulsive ^ 
mterachort Ella from Erytkrim vorilgata, belong- 
ing to group c has the same valine residue at the 
N terminus as EH and shows the inhibitory 
a ^ .«*• In contrast, the N-termirS 

inhibit tPA, even without extra negatively charged 
™^ at *" N terminus (Table S^Thfc fclK 
vft^r 7 *f ^interaction of the N-termiri 
Vail of EnwithGWAintPA. 

There is aoronounced difference in the surface 
S^T^^ m ' des P ite similarity intfSr 
overall three-dimensional structures (Figure 13) 
Another ngon of fi>A important for l3*l£ 
■ This p-hairpin loop protrude 

considerably from the active site of tPA Wther 
wth Glu93 '- *e side-chains of As P 95',A S p^' S 



Aa»97' form a highly addic surface patch in the 

Asp9^Asp96' and Asp97' interact with the po^ 
r fa vdy-charged side<hains of Lys72, Lysm^nd 

which do not show inhibitory activity toward tPa! 

nZlfr and Lysl35) art 

replaced by neutral or negatively charged residues 
wWle a positive ^rgeTretain^^^^ 
c^respondmg to LyslW of ETI (Table 5). Thesk£ 
chata nrtrogen atom of Ly$72 makes salt brides 
w fU ade ^ m 0xv S en aton w of Asp96' and 

2Erf? ^ P SL^- ™ e rfd^haSnTtrogS 
atom Of Lysll6 in ETI also interacts with theS 
A* o MT in tPA. However, this indent 
may not beunportant in the selective inhibition of 
*PA by ETT, smce the corresponding residue 




GRASP (Nicholls, 1992). **** acidi c patches are labeled. This ^Sfed^^ 

using 
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Ai£ll9 m STI is predicted to make a similar into- 
f**™ witix Asp96' of tPA. The side^iain of 
LysI35 in En makes at least three hydrogen bands 
with the side-chains of Asp95' and Thr98' in 
addition to the ionic interaction with Asp95' 
(Figure 11(b)). p 

A mutagenesis study on ETIa from H. variesoia 
showed tiiat the tPA inhibitory activity of the 
smgte mutants R61P (P3) and L62F (P2) b sterol 
cantly reduced and, furthermore, the double 
mutant R61P/L62F (P3/P2) lacks the tPA inhibi- 
tory activity, despite retaining the trypsin inhibi- 
tory activity (Kouzuma et d., 1997). These mutants 
were designed on the basis of the sequence differ- 
ence between ElTaand ETTb (Table 5). The euani- 

•J^ p 0f Arg61 m m hydrogen bonds 
wito the mam-chain oxygen atoms of Argl74' and 

£E w% A The « ^idu« of 

sn ajwl WO are proline and glutamic, respect- 
nrdyjBoth crystal structures of STIf PT(ortho)^nd 
^PflXte&a) show that the sldeW, of Tyrf2 
J* 2 '*? ST L^ es * van der Waals contact With 
I f" m PPT- Interestingly, the interacting side- 
chains are switched in the ETLtPA complex and a 
comparable interaction is possible between the 
side-chams of Leu62 (P2) in ETl and Tyr9? in tPA 
A unique feature of FIT (and ETIa also), as com- 
pared with the Inhibitors which do not inhibit tPA 
a eJ"?**^ °* con secutive acidic residues' 
Asp69, AspTO and Asp71 (Table S). However, its 
sj^cance e not clear, since only Asp71 makes 
hydrogen bonds with the side-chain of TvrW 
A "«^t A65Y (P20 of ETIa from E. varicZa 
exhibited the same level of tPA inhibitory aS 
as me wJd-type EITa. (Kouzuma et al„ 1997). Mod' 
ehng indicates that an aromatic ring of tyrosine at 
^iKf^ n ? 1 L mtei ? ct fav °»bly with that of 
TyrlSV m tP A through an edge-face interaction, 

£L^Sr?T, m0d f 0f aromatic-aromatic 
mteracoon f^urtey & Petsko, 1985). Other residues 
of ETI which are possibly involved in the inter- 
action with tPA are GInll. Asnl2 r.h,v* ^ 
rhelll. Table 5 lists the residues of ETI which pfcy 
an important role in its interaction with tPA 



tfwwbanee at 280 nm for the 1.0 cm path length. Cry* 
tallizafaon was achieved by the hanging em» lapordif- 
nioon method using 24-well tissue cufim* plataT (How 
Laboratories) at mom temperature (22 ± VQ. The haxic- 

?*? bv «q«al volumes of tfe 

proton solution and the reservoir solution. The sriiPpT 
complex have been crystallized in two crystal forma. For 
mc : orthogonal form, the reservoir solution is 30% (v/v) 
fff ^."g MmM lithium sulfate in 100 mM 
at a final pH of 7*7. Crystals of typical dimensions^ 

S;ZL X i fTl 1 l mm within a w «ek- 
the tetragonal form, the reservoir solution is 1.70 M 

ammonium sulfate and 10% (v/v) dioxane in 100 mM 
Mes at a final pH of 5*8. Crystals of typical dimenstons 
S^uT* * in'"^ * O- 4 =™ grew within a week. The 
^teUeday^^te, condition for the free STI has 
been described previously (Lee et at, 1993). Crystals of 
dimensions of 05 mm * 04 mm * m 
about 25 days. ™ x 0,2 mm S^ w w '*m 

X-ray diffraction studies 



Materials and Methods 

Purification and crystallization 

S^andporone pancreatic trypsin (T-7418) were pur- 
^ J 5ed Sgna. Fc* (he complex formation, STI 
<^™8> J" 1 .WT (102 mg) were dissolved in So mM 
sodium chloride solution buffered at dH an «rfSJ 
tOOmM Tna-HGl and w«e feft^ h TOs^ 
uhon was loaded on a Sephacryl S-200 HR <re filtafcm 
™ x 76-0 cmj Jhk hid b^iSS 

KS^.^ pleX Wete ^ concentrated 

fiCi°^ /m) ^ lcentralkm (1* ml volume) by ultra. 

^£ P ? tem concentration was estimated by 
•nwaunng the ab^ance at 280 nm, aBumiSTconi 
spondence of l.Omg/ml c^tauonTVe^U 



.I?' ?"? 7 , data °<>"«*«a a crystal was mounted in 
a thm^altedjlass capillary and both ends of the capu. 
bry were OUeH with the mother liquor and then seakd 
with wax. Three sets of data were collected at three 
places usa^ difforenttypes of detectors. X«y diffraction 
T 5n have been dcscnW previous^ 

taTabk >6 B^WKy of data collection is given 

Date from an orthorhombic crystal of STtPPT com- 
f^TT a MAR image plate X-ray detelSr 

at the beamtme X12C of the N^dSyndmZ^l^t 
Source^ BrooWaven , National Laboratory (Sw«t 19ffl). 
Ail data were reduced and scaled usine DENZo 

TJ«c^4ffracied to beyond 1.7 A resolution using 
2^roho«j X-rays. Despite much difference in crystal 
atran conditions, our orthorhombic crystal of the com- 

aystal of STI:Ppr implex 
were collected at 14«C usmg a Weissenbcrg camera for 

19911 a ^77„." """OV japan (Sakabe, 

r^^fnLr UTlf fTl m cr ^ taL ^ illation 
mnge per image plate was 3.0', with a speed of 2JP/ 
0 Vi« ~ t of ^ /OTn . iC^la^of 

pftes W^Tl^^ ^ «»tiguous image 

dt^lrc^uL^ BA ^ 
(HigashL .A'SSte 

group was later found to be correct on the basis of 
«wnslat«m function calculations. 

Molecular replacement 

^1^) a^AM^^vaS 

me L^^™ F ° >2 °* WG « used throughout 
me molecular replacement and subsequent refiJ™,, 

™ ™f 811 structure was attempted, usine the r,». 
viousiy reported model of ETTas the 3 sSLL™ 
(Onesti rf al 1991- pnn m r^P^ structure 
FUB ID code. VUE). A consistent 



JUN 27 20,83 3:39 PM FR CISTI I CIST 



613 993 1395 TO 17188899320 



P. 17 



Structure of Kunitz-type Soybean Trypsin inhibitor 



361 



Table 6. Data collection statistics 



SnOTT((etra) 



sn 



X-rays 

No. of crystals 
Resolution limit (A) 
Space group 
Cettaxe5( a/ fr,cinA) 
Asyanmetric unit 
V M (A7Da) 
Solvent content (%) 
No. of total reflections 
No. of unique reflections 

Data completeness (%) 



1,000 A (N5L9") 
175 

58.91, 6233, 151.46 
1 (STLfTO 
3.16 
61 
419/133 
S2>687 
7.8 

913 (5O0-1.75A) 
6^60,80-175 A) 



' Seoul National Universily, Seoul, Korea. 



UX»A(PF*) 
1 
1.80 
P4,2,2 
62.45,62.45,229.11 
1 (STLFPT) 
236 
32 
118,553 
26.956 
79 

62.0(600-1.80 A) 
37.2fl.9Q-l.80A) 



USA. 



CuKa(SNU<) 
1 

2.1 

«,2A 

39.42, 58.08, 9635 
1CSTI) 
2.75 
55 
27,86* 
10238 
4.3 

759 (3&0-Z10 A) 
267(2j0-2.t0A) 



rotation and translation solution was obtained for differ- 
ent ii«olution ranges. And the refinement of the free STI 
n^wnvj^ slowly to an Reactor of about 25% for 

inJ^Jld * h ' dH * ^ data from an 

orthorhombic crystal of OTPPT complex became avail- 

* "*" ,d b V ™>lecular replace- 

TeA EuSJ'i ° f Pepsin refined at 

1.6 A (Huang el 1993; PDB ID code, 1MCT). After 
renting and positioning the PPT model done according 
to the molecular replacement solution, an ft-bctor of 
4X4% and a correlation coefficient of 0.50 were obtained 
for 10.0 to 44) A data. A model for the complex was then 
senerated by positioning (he partially refined model of 
lIvT"^ 3 ins P« ction of *e electron den- 
fX%* ffi> »** ■"Action between trypsin and the 

Zzf^ Cc ! rim *• 511 " toe complex 

unpmved dunng the successive refinement steps and the 
^1« structure was refined to an K-fector of 18.9% for 
™ , dala - Now ' fcgNy refined model of STI 
m the complex wasuscd in turn for a structure re-deter- 
MSV V"*,^; " « ave » <^ rotation solution 
&™ rJ '™ 1,45 ' ): = 10 - 49a ) ™a « tansktion sol- 
t^on (x = 0OT9, y = 0.2867, z = 03261 in fractional 
cw^tes). After the STI model was oriented and posi- 
S.J^rt to the above solution, it gave an /fac- 
tor of 3*6% and a correlation coefficient <3n.73 i™ i-> n 
ro *u A oata. 

The starting model tor STU>FT(tetra) was the 
refined model of STI:PPT(ortho). The search for tf£ 
n^lecuJar replacement solution was performed simj- 

P4A2 by translation function calculations. For 10.0 to 

^wl?' t ^" faCtor t 0f 35 " 6% ■*> « correlation 
coefficient of 051 were obtained for P4,2,2, compared 



resolution data being added in steps. During this 
^'J^ dropped to 275% for 8X1 to 25 A 

data. Tlus model was men subject to a simulated anneal- 
ing refinement employing die standard slow coolina 
protocol: from 3000 K to 300 K (time-step 03 fs/dm£ 
ment of <*mr*rature 25 K; number of steps at each tern- 
perahue 50). Solvent molecules, modeled as water, were 
located as high peaks in the (F 0 -F) maps and were 
included in the subsequent rWS oTrXm^ 
2 water molecule was removed when its temperature 

tons up to 175 A were added in steps. Further simu- 
!SL?T£2! r ! fincment ^ the refinement of 
TS I 1 . aWm * gave the final 
f ^, f «. 8 9% for 3l ' 03fi "^que reflections with 
a£ ^.S^ 8 '° 40 V 5 A. The final free R-factor 
u 21.4%. The refinements of STtPPTftetra) and free STI 
models were performed similarly. A summary of the 
refinement statistics is given in Table 1. 

Model building and structure analysis 



Refinement 

,„ T if STfcfTTfortho) model was subject to refinement 
&*£ pf0Sram X ' FL0R vision 3.1%ar^Tl992> 

f^l* «o further improve the positional^ 
o^tetional parameters. The higrwescJuHoTLit of *b 

I! A jVL m Ct ° r "i* 15 sta f e was »•» * for 8.0 to 
f£vL1f£ I N<a,c '. aton '« P««&™ we* refined by the 
eoilvenbonal conjugate gradient mirumiaaSorv witft 



r J^P r °8^ M i H AlN version 5.4 running on a Silicon 
Graphics Indigo* XZ workstation was used far ™£l 
rcmmom S laacK, 1988) and the model stereocheznistr^ 
^^sed by PROCHECK (LaskowsSTSTT^S 
Statural coinpansons were made using the program 

I^resbury Uboratory, 1994). The orthortW.ic crystal 
sta^ of SpPT complex was used as a tempEte to 
buOdahomology models of tPA complexes wift mMbt 

n£(vria?'J?l l P A code : 1RT f) was superposed with 
rrj (wth an r.m.s. deviation of 0-92 A far im r« 

^ *~cture (OnestTeTl^S Sbd 

Sod*X^±? ^P^posed with STI. The resulting 

wf™ , JSL** < ^ np,exes between *e inhftitors and tPA 

OiARMM dynamics in the program package QUANTA 
^ruSt ^ GRASP (Nwhons, 1992), and MOI^ 

Tte l^?J^*Z Z ? 0tkSt * ti<n display. 
P^teK e u^^^^« M 
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Amino Acid Sequences of Kunitz Family Subtilisin Inhibitors from 
Seeds of Canavalia lineata 

SMgeyuki Terada, Hideki Katayama, Koichiro Noda, Satoshi Fujimura, and Eiji Kimoto 

Department of Chemistry, Faculty of Science, Pukuaka University, 8-19-1 Nanakuma, Jom-ku, Fukuoka 814-01 
Received for publication, September 14, 1993 

Ihe amino acid sequences of two subtilisin inhibitors (CLSI-II and -HI) from Canavalia 
lineata seeds were determined by manual Edman degradation using the DABITC/PITC 
double coupling method after enzymatic digestions and CNBr degradation: CLSI-II and -in 
consist of 190 aud 183 amino acids, respectively, and have identical amino acid sequences 
except for in the C-temnnal regions : an elongated sequence, Gly-Thr^De- Arg-Ser^-Glv 
was found at the C-tenninus of GLSHL A short-chain analog protein without an N.' 
terminal Asn residue was also detected in each inhibitor preparation. The inhibitors 
showed significant homology to Kunitz type inhibitors, but differed from them with respect 
to the half-cystine content Among five half-cystine residues present in CLSI-IH, two 

r^due, Phenylglyoxal treatment abolished the inhibitory activity of CLSI-IH, in<&atin* 
the participation of an Arg residue in the interaction with the enzyme. The reactive-sito 
peptide bond was deduced to be Arg^Glv". 

kSbiSr? 1 amin ° S€qUenceS ' Ccmai7aifa lineata, Kunitz inhibitor family, subtilisin 



Subtilisin inhibitory activities have been detected in many 
legume seeds (1-3). Subtilisin inhibitors from broad bean 
(4) and adaiuki bean (5) have already been sequenced. 
These have low Mr of about 8,000 and belong to the potato 
1 type inhibitor family. Bifunctional inhibitors that inhibit 
both subtilisin and or -amylase simultaneously have also 
been purified from cereal plants, end their sequences 
showed homology with Kunite type inhibitors {6-8). 
Kumte : type,proto^ 
sfee*fA&^21iO0O-^^ 
4>onds;(4%df^^ 
^fb^erine^ 

Subtilisin isoinhlbitors of Mr 8,000 have been purified 
from jack bean (Canavalia ensifhrmLs) (3), but their amino 
acid sequences have not been reported yet! "Recently, we 
isolated and characterized two subtilisin inhibitors, CLSI- 
II and -m, from seeds of a similar bean, C. lineata (10). 
These were considered to be Kunitz type inhibitors in spite 
of the presence of 5 half-cystine residues, unlike in all other 
Kunitz inhibitors. We have herein confirmed the classifica- 
tion of CLSI-II and *m by deterinining their complete 
amino acid sequences. These are the first Kunitz type 
subtilisin inhibitors from legume seeds of which the 
primary structures have been elucidated 

MATERIALS AND METHODS 

This portion of the paper, in addition to Figs. 1S-10S and 

Abbreviations; CLSI, Canawlia kWa subtilisin inhibitor, DABI- 
TC/PTTC, 4-Mtf-dimcthyl^naa^^ 

phoiiy^th^anate^OT, Kunite soybean trypsin inhibitor; », 
S-pyndylethyiat«i; CM, 5-carboxymethylatwi; TFCtC. t-U-tosyl- 
amido-2-pheayi)ethyl chlor^methyl ketone. 



Tables IS-VIS, is presented as a mmiprint at the end of this 
paper. 



RESULTS 

N-Terminal Ammo Acid Sequences of PE-CLSI-S and 
-iZT-The N-terminal residues of PE-CLSMI and PE- 
CLSI-m were analyzed. In each cycle of manual Edman 
degradation, aim oat equal amounts of two amino acids were 
detected for both inhibitors* For example, PE-CLSI-Hgave 
^ *W» of^) ; (D,y)-(V,D)-{D,V)-(V)-(V,M)-(M ( 

VXUtA) This clearly indicates that two peptide 

^ h 2 inS ff 6 P"* 8 ®** ^ taMWtor pw|»i«tidn # one of which 
is nevoid of the ^terminal Asn residue. The same phenom- 
enon was observed on the sequencing of all peptides 
containing the N-terminal parts of the inhibitors. Never* 
theless, the N-terminal 24 residues of CLSI-II and 18 
residues of CLSI-m were identified (Pigs. 1 and 2), and the 
sequences were identical. 

Enzymatic and Chemical Digestion of the Seduced and 
Alkylated Jnfc&itors-Feptides generated on iysyl endo- 
peptzdase digestion of PE-CLSI-II were fractionated by 
reverse-phase HPLC on a a 18 column (Fig. IS) and then 
subjected to amino add analysis (Tables IS). The peptide 
peaks named K5/6 and K7 represented mixtures of two 
peptides. K5/6 was rechromatographed on the same col- 
umn using different gradient conditions to give two pep- 
tides, K5 and K6. On the other hand, K7 was further 
digested with TPCK-trypsin without separation of the 
constituent peptides. The ehition profile of the tryptic 
digest is shown in Fig. 2S. The amino acid compositions of 
the purified peptides (K7T1 -K7T5) are also cited in Table 
IS. The digestion with V8 protease gave ten major peaks 
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Fig. 1. Summary of amino 
add sequence analysis ofCLSI- 
H. Peptides derived from CLSMI 
on digestions with trypsin (T), 
lyoyl eodopeptidwe (K), $. aureus 
V8 protease (V) , and CNBr (M) are 
shown above the sequences. Sesi. 
dues not identified are indicated by 
dashes. The N-tenjiinal part of the 
inhibitor was determined by di- 
rectly sequencing FE-GLSI-H and 
is shown by on underline. 



EIDW™!GGraW0VPSCTFTlO& 

* |ff *KACVG tGGHEDHPGEO — 
(K5 



1 "ilOW«e^ILTHH&»F< 



FD 



* ^ACVG J GGHEOHPGCOVf SG ff T | OX^^^^^^LVTCESOSSTCSO I GRYOftNEGGRRL* - • 



V2 V9 

Ml 

OASTFO 



Pig. 2. Summary of amino 
acid sequence analysis of CLSI- 
HI- Residues not identified are 
indicated by dashes. The peptide 
nomenclature is the same as in fig. 
1. Two disulfide bridges and a 
single cysteine residue are shown 
by solid lines and an asterisk, 
respectively. 



(V X-V10) on HPLC (Pig. 3S). The amino add compositions 
of the peptides are summarized in Table IIS. Chemical 
cleavage with CNBr gave four peptides, namely, Ml, M2, 
M3, and M4, on fractionation by reverse-phase HPLC (Fig. 
4S), Table mS shows the amino acid compositions of these 
peptides. 

A slightly modified procedure was employed to digest 
CLSI-m. PE-CLSI-m was subjected to CNBr cleavage and 
the peptides were purified as described above (Fig, 5S). A 
large peptide, M4 in Fig. 5S, possessing the sequence of 
PAIWGPPVRLAKTRNSDCPV-^, was furthe/digested 
with Iysyl endopeptidase to give eight peptides, M4K1- 
m RiF* . 6S) ; CM " CL SI-ni was used instead of the 
^inhibitor for V8 protease digestion (Fig. 7S) . The amino 
acid compositions of peptides derived from CL3I-IE are 
cited in Tables IVS and VS. 

Complete Amino Acid Sequences of CLSLH and -Jil— 
The complete amino add sequences of CLSMI and -IE are 
shown in Figs. 1 and 2, together with details of the 
overlapping peptides from which they were deduced The 
locations of peptide K2 in CLSMI and M4K2 in CLSI-IH 
were determined to be exclusively at positions ranging 
from 130 to 138. The sequence of CLSMI wntaked 190 
anitoo add residues corresponding to 20,7700a, while 
LL3LJH consisted of 183 residues corresponding to 20,023 
Da. The total numbers of amino add residues in the 
sequences were in good agreement with the amino acid 
compositions of the two inhibitors, as described in a 
preceding paper (10). The sequences of the two inhibitors 
are different from each other only in the C-terminal parts of 
tueir molecules. The heptapeptide, Gly^-Thr-Ile-Arg- 
Ser-Asp-Giy 190 , is attached at the C-terminal of CLSMI. 

Location of the Disulfide Bridges— Five cysteine resi- 
dues are located at positions 44, 88, 142, 149. and 106 in 




Reaction Time (hour) 

** ?. Modification ofarginine residues of CLSI-m and SXT 

The mhibrtore were treated with 0.13% phenyklyoxal in 0 2M 
iV^ylmorpholine^cetic add buffer (pH8,0) at 25'C for various 
penod£ and then the remaining inhibitory activities of CLSl-IH (O) 
d£2& ^ ™** BPN ' ««i trypsin, respectively!^ L2 



toth inhibitors. To locate the disulfide bridges, unreduced 
Offl-ffl; was digested directly with trypsk On^S 
Sf 6 ^ 0, j^^e-containing peptides, namely, 
NTl, NT2, and NTS were obtained (Kg. 8S). The amino 
aod compositions of NTl, NT2, and NTS corresponded to 
residues 139*154, 41-62 and 85-93, and 105-129 resric 
tively (Table VIS). The sequences ^^pSsXi 
also confirmed by Edman degradation tota S f 
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nti^lVf-x-e-s-d-s-s-t-x-s-d-i-g-r 
nt2, n-s-d-x-p-v-t-v-l^q-d- 

K-P-E-X-A-S-S-S-K 

109 112 

NT3. A-X-V-G-I-G-G-H- 

The results indicate that two disulfide bonds link Cys" to 
Cys« and Cys M * to Cys u «, Cys 10 ' being present as a free 
Cysteine. 

Modification of Arginine Residues — The relation be- 
tween the reaction time with phenylglyoxal and the resid- 
ual activity of CLSim is shown in Fig. 3. The subtilisin 
inhibitory activity decreased significantly with the reaction 
time. The inhibitor incubated similarly without phenyl- 
glyoxal retained the full activity after 24 h. A soybean 
trypsin inhibitor (STI) possessing an arginine residue at the 
Pt position of the reactive site (19) was treated similarly 
for comparison. The trypsin inhibitory activity decreased 
with the progress of modification of the arginine residue. 
This suggests that the arginine residue of CLSLIQ might 
play an important role in the interaction with an enzyme. 

Modification of the Reactive Site— To directly locate the 
reactive site, CLSI-IE was treated with 68 mol% subtilisin 
BPN' for 1 h at pH 7.6. The inhibitor was separated from 
the enzyme by reverse-phase HPLC on a Biofine RPC-SC18 
column, and then reduced and S-pyridylethylated. Frac- 
tionation of PE-CLSMD gave three peaks on the same 
column (Pig. 9S). The yield of the modified inhibitor was 
about 5%. The amino- and carboxy- terminal sequences of 
fraction C were determined to be Gly-Ser-Gly-Leu-Ile-Ile. 
Thr-Asn-Thr- and - Ala-Ser-Thr- Phe- Asp, respectively. 
Fractions N and I gave Asn and Asp as the arnino-terminal 
amino acids. The amino acid analyses showed that fraction 
I is intact PE-CLSI-HI, and that N and C correspond to 
residues 1-68 and 69-183, respectively (data npt shown). 
This suggests that the reactive site of CLSI is Arg^-Gly 6 *. 
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Fig. 4- Comparison of the amino acid sequences of Kunitz type 
inhibitors. DB-3, Erythrina coffin trypsin inhibitor (24)\ ETIb, £, 
variegato trypsin inhibitor (26); STI A, soybean trypsin inhibitor 
{2$); WTI.U, winged bean trypsin inhibitor (27); (f) WCI-3, 
winged bean chymqtEypsin inhibitor (2$); BASI, barley eubtilisin/ 
or -amylase inhibitor (7); I , reactive site. Residues identical to in 
CLSL-U are enclosed in solid boxes. 



DISCUSSION 



We have determined the primary structures of two sub- 
tilisin inhibitors from seeds of C lineata. These are the first 

tCrmr+rr ttmn pnVfi'l^tw iTV.r&^._<. £. f 1 » 

4%mmawi vj DUkiutucuu uuuuiuiia iiv;U 4.eg Utile SWUS 01 

which the complete structures have been elucidated. The 
presence of a protein lacking the N-terminal asparagine 
residue was recognized for both inhibitors. Since Canavalia 
seeds contain a significant amount of a protease which 
specifically cleaves at the Asn-X peptide bond (20), such an 
enzyme might participate in the generation of the short- 
chain proteins. 

CLSI-n and -HI are isoinhibitors with different chain 
lengths. Isoinhibitors with an identical core sequence but 
different N-terminals have been found in many legume 
seeds (21, 22). The C tineata subtilisin inhibitors in the 
present work as well as Bowman-Birk type trypsin in- 
hibitors (23) from the same seed showed structural heter- 
ogeneity at their C-termini. The wider inhibitory spectrum 
and higher stability as to extreme pH of CLSMI compared 
to CLSI-IE should be due to the extended peptide chain at 
the C-terminus. However, it is not clear how the elongated 
residues endow the inhibitor with such properties. 

CLSI-II and -EH contain five Cys residues, four of which 



are linked by disulfide bridges. Cys 106 is assumed to be a 
cysteine residue and might be involved in the formation of 
dimeric molecules through an intermolecular disulfide bond 
when the inhibitors are stored for a long period in solution 
(20). 

InFigi4 > vtheanihioa^ 
t^mhibtftOT 
homology with th^ 

^^/The^^ 
these inhibitoiB^T 
residW^iden^^ 

of "CISIr The G ^ ^ 
greatest homology^ 

betweenrG^ 

(tf-S)^torela^^ 

en?yme;6pecifiritie8^ 



^"^t^a^a^fRPraM^ m sweet pototo p& ^f&^ f ^ 
in swee^rry t 
We identified the reactive site of the inhibitor as 
Arg"*Gly M by comparison with other Inhibitors. The 
decrease in inhibitory activity upon modification of the Arg 
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residue(s) and the selective cleavage at this bond of 
CLSI-m by subtilisin support this conclusion. In spite of 
the occurrence of an Arg residue at the t t site, CLSI did not 
inhibit trypsin. Subtilisin has a hydrophobic pocket form- 
ing a subsite, S 4 , and the S 4 -P« interaction was shown to be 
important in the formation of an enzyme-substrate (in- 
hibitor) complex (32, 33). The presence of a bulky hydro- 
phobic residue, Leu, at the P 4 sites of CLSI-II and -HI 
favors the interaction with the hydrophobic S 4 site of 
subtilisin. In contrast, trypsin has only subsite S t -S, and no 
Counterpart of the S 4 pocket of subtilisin (33), Thus, a 
bulky side chain at the P 4 site may interfere with the 
binding of inhibitors to the enzyme. In fact, all known 
Kunitz type trypsin inhibitors bear either Ser or Thr at the 
P 4 site. 

Plots of hydropathy profiles (34) were compared among 
CLSI-m, & caffra DE-3, and STL The;subtilisin inhibits' 
sho^ to ths*tryj^ 

except in the region (re 
site (Kg^l 

secqn4wy^ ii^i^^^^^^W^^i theise Kimits in- 
j^jt^,. irres^^ As 
deicrib^ previously, a ingle cysteine residue (Cyis loc ) of 
CLSI was not involved in the interaction with the enzyme 
(10). U CLSI has a similar tertiary structure to other 
Kunite inhibitors (35), it is deduced that this residua is 
located apart from the reactive site. 
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Supplemental Materials 



MATERIALS AND METHODS 

MfXtrids—lht subtiiisin inhibitors w** * au^ ;r;*H *• 

wy|*mdo-2-*benyOctayl chlor omcthyl ketone (TPCK) 
Achromctocu, tyttcut lysyl ondopepdiasc. Staphylococcus 
^Z us Jl P^asc aad all other chemical w^ ob^iS 
from Wako Cbem. Co. (Osaka), «™«d«i 

Chemical and Proteolytic Digestions — The inhibit i 

Si/.:^ 11 ^ ^-pyridyltrthyUted <PE)-protcins (3 mg each) wet* 

lflf C SiiV7 ,m ^"P** 1 "" cyanogen bromide ia as 
ml of 70 % tonic «. |d n6 ->g * ^ ^J'™ £~ 

o^TL ?«d S^?* 0 i0 2 2o mM ***** <P« 9.0) at 37 X 
NH4HCO3 *t 37 *C for 8 h, A pare of CUMll was also J- 

described above. Trypiic digestion was achieved with TPCK 
trypsin (675=1:50) in 0, 1M NH4HCO3 «37 Xf or 41 F§T 
**k*a were lyophilized. dissolved in 5 M vrca-0 1* 
<rif Iporoacetic acid (TFA). and then fracYion^cd by revere 
phase HPLC on a eiof inc RPC-SC18 column (0 46 1 25 cm 

SSiiusii!* TFA WUh " appfopriate »* * 

j N J** 1 * ™ *5 !hfi *»xtvr* neutralized with 

at 40 c for 48 h. The resulting peptides were separated bv 
rev„se. P na« HPLC on a Cl8 column as descr ib/d"bgV e , 

amino acid compositions of pep-ides wet* determined Bain, 
an am.no acid ae»|y,i» 5y 5 ,em (Jasco) after TydrXis w $ 
0 25« pncnol-2% thioglyeetic acid-«M HCI (74) f £ 20 h at 
1 10 C m evaeuaied sealed tubes. The amiiw acid ,%u«ceT 

^^^^ W*E5a^/» 

« Biofine RPC-SC18B column (4.6 x 250 («m; Jai?), ^ 



tjcaimcnt Of the inhibitors was performed according to the 

Be/ml) lo 0.2 M N -ethylmorpholinc-acetic acid buffer (pH 
£0) fm ; added 200 of 0.3 % pheoylglyoxal in tte » 
buffer After incubation at 25 X for various periods, the 
T l™^! ■jWW^wy activities were measured as described 

Pbenyl^lyoxal was also 

7.6) for 1 ft at 30 C with 68 moI% subtiiisin BPN'(0.9 mg 
u Dm S; V* TC *£™ mijttttrc w « objected to revcrae- * 
phase HPLC on a Biofine RPC-SC18 column* and elation was 

of 0. I wTFA and aeetonitrile (0^5% for 60 min). Th? 
inhibitor el u ted ax 40 mi a was recovered and S- 
jWtfyi«nyUted as described above. After ucess reagent had 
been removed by dialysis against water, the inhibitor 

^iSm^rLV 5 ^^ *? rcvc f*-P*A» HPLC on the same 
column- The C-lCrminal amino acids w C rc identified bv 
carboxypeptidase digestion (18). 7 

Table ns. Amino acid compojitioos of V8 piwease peptides derived from PE>CUWK 



Ammo 



acid 

ASp 

Thr 

Ser 

Obi 

Pro 

Cly 

Ala 

Val 

Met 

Ik 

hen 

Tyr 

Vhfi 

His 

Pecb 

Tip 
Ly» 



VI 



V2 



V3 



V4 



vs 



Vd 



V8 



V9 



VJ0 



1.0(1) 
1-0(1) 



0.9(1) 



0.9(1) 



5.0(5) 1.9(2) 

0.9(0 1*2) 

3.0(4) 10(2) 

13(1) - 

U(l) U0) 

- U(i) 

0.8(1) I 

i.0(i) 



2.60) 1.1(1) 2.7(3) 



i.«(2) 3.2(4) 2.7(3) 



1.6Y2) 



2-6(3) 
1.2(1) 



1.1(1) 23(2) 4.6(5 !•?<!? ^) 
XI (2) 
1^(2) 



2.0(2) 



Total 6 
Position 82-87 



0L9(i) - 



16 



2.0(2) 1^(1) X6(3) 

4.5(5) 1.7(2) 4.0(4) 

t.«U) 1-0(1) . . 

2^(2) 2.7(3) 1.7(2) 3.1(3) 

WI) 1.2(1) 1^(2) Li(l) 3.1(3) 

; ; im W 

0.8(1) 1.0(1) 2_5(3) |.7(2> 1.9(2) 

Z2(2) 2.0(2) - 

1.0(1) 1.8(2) - 
0.6(1) - 

1.0(!) 1.1(1) 3.0(3) . 1.CK1> 

0.9(1) 1.7f2) 00(1^ 

40 *" — 



-U Li. 



19 



27 



*Valnesin 



Amino — — 



17*189 1^ 9.19 88-1Q3 104-143 160-178 3^ fii " 
are taten inis me sequence data. ^.pyfidy|eihyIc>Meinc. 

Table IUS, Amino add compositions of CNBf 
peptides derived from P&CLSi-lI*. 



3.0(3) 
2.0(2) 
1.0(1) 
IJ(I) 
4JS£4) 
3.9(4) 
2.4(2) 
3-2(3) 
1.0X1) 
i0f2) 
2.2(2) 
1.7(2) 



0.8(1) 
0-3(1) 
1.0(1) 

JrfW 



32 



20-31 



Amino 



actd 



Ko K/ii K7T2 



5? 
Thr 

Ser 

Clu 

Pro 

Gly 

Ala 

Val 

Met 

ne 

Leu 

Tyr 

Phe 

Hi* 

Pecb 

Trp 

Lys 



" 1-ioi i.6(i) 

1.0(1) 1.7(2) 

2.70) iXfr) wi) 

•Hi) - 4.2(4) 



K7T3 K7T4 K7T$ 



Mf 



11(1) 0.8(1) 1.0(t) 

;.<xo : 

1.6(2) 

1.9(2) 

1^(2) - * 
1.7(2) 
1.9(2) 

0^(0 - 0.8(1) 



;,w„ 26<3) L6t? > - *-5o 

iffl? I0CI) °« t > ■ 3«4) 

23(2) 23U) 3.2(3) I.l(i) i.Sl) 

1.8(2) 2%) - 

3.3(3) 2,2(2) 2.0(2) 



1.5X2) 0.8(1) 3.7(4) " 



1-3(0 
1.UJ) 



Total 



1.9(2) 0.9(0 0.9(1) 



1J(1> 2.8(3) O^i) iw,) 

2.1(2) 1.0(1) , 1,0(1, 

* * 0.9(1) . 

*.<H0 2^X2) 1.1(1) - 0,9(1) 



<-*C2) 



4.0(4) 

2.1(2) 
1*3(0 
2 *5(3) 2.0(2) 
4.1(4) 2.0(2) 
12(0 1.0(1) 
0,9(1) 33(3) 
09(1) 1.1(1) 
2-2a> 1.2(0 

2.0(2) 
10(0 



9 9 M 

roatwn tt5>03 i1rLi« i/vc .... — -• __j '» i** 77 



0^(1) - 
>0(l) Q.fi(0 
_Q.9(0 



M 



0.9(1) - 

0.5YI) 
11(1) 



3.7(4) 
3.1(3) 
2-1(2) 
13(1) 

0. 6(1) 

1. *2) 
13(0 
2.0(2) 
1^(0 
2.0(2) 
2.1(2) 

3.0(3) 
2.4(2) 



10(1) 0.9(1) 



1.1(1) - 

0.7(1) - 

1.0(1) . 07/11 

14 21 27 



Asp 

Thr 

Ser 

Glu 

Pro 

Cly 

Ab 

Val 

Mett» 

lie 

Leu 

Tyr 

Phe 

K* 

PecC 

Trp 

Lys 



2-6X3) 
0.9(1) 



M2 


M3 


3.2(3) 


ll(l) 


0-9(1) 


0.9(U 


1.6X2) 




1.1(0 




13(2) 




23(2) 


10(0 


1-2(0 




0.6X1) 




2.0(2) 




1.8(2) 


1.0(0 


0.9(1) 











M4 



0.9(1) 



Total 



16 



9.0(10) 



4^(4) 
U^(13) 

a<s( 0 

1 0.4(10} 
7,7(7) 
3.6X4) 

4.0(4) 
4^(5) 
13(2) 

7.0(7> 



[T7r - 1.7 178.100 ^ 
^g^nined as homoscrme plus ktoioserine lactone. 
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Tabic 1V3. Amino acid corapoarions of CNBr peptides demod from P&CLSI-fll. and peptides 
derived OH tiypric digeatran of M4 in Fig. SS* 



acid 

Asp 2.2(2) 
Thf 0.8(1) 



Ml M2 M3 M4Kt M4K2 M4K3 M4K4 M4K5 M4K6 M4K7 M4K8 



Gla 

Pro 

Gly 

Ab 

Vri 

Metb 

lie 

Leu 

Tyr 

Phfi 

His 

fee* 

Tip 

Lyi 

Aft 



1.0(1) 



1.1(1) 



1.0(1) 



3.20) 1.1(1) 

I,6(2> 
t.0(l> 
1.6(25 
1.9(2) 
10(1) 

0*1) 

1.9(2) 

10(2) 
1.0(1) 



2L4G) 
0-5(0 



2-8(3) 
1.1(1) 
0.9(1) 

1.0(1) 



1.0(1) 
0.9(1) 
1*2) 

1.0(1) 



Positton 178-183 



1.9(2) - 



- 1.0(1) 13(2) 

i.7(?) - 

09(1) 0.9(1) 

3.9(4) 2.0(2) 

3.0(3) l.l(l) - 

4.1(4) 5.0(5) - 

1.9(2) l.l(l) - 

1,0(1) 1.9(2) 1.9(2) 

1.0(1) 1.9(2) 0.9(1) 
1.0(1) - 



0.6(1) - 



1.0(1) 2,0(2) 1.0(1) 



16 



1-8(2) 1.0(1) 

1^(2) - 

(MS(1) - 

0.9(1) * 0.8(1) 

1.0(1) 1.0(1) 1.1(1) 
1.0(1) - 



1.7(2) 
3,2(4) 
311(3) 
1.0(1) 
3.0(3) 

2,5(3) 
0.4U) 
1.7& 
2-9(3) 
0.8(1) 
19(3) 
1.7(2) 

1.6(2) 



1.0(1) 3.4X3) 

2.6(3) 1.8(2) 

LOU) 0.9(1) 

2.0(2) 3.1(3) 

2.0(2) 2.1(2) 

3.1(3) 2,0(2) 

IAD 3.8(4) 

35(3) a7(l) 
IW) 1.0(1) 
1.0(1) 
1.8(2) 1.0(1) 

1.1(1) 



15 



11 



1.0(1) 1.0(1) 
3.1(3) 0.9m 1.0(1) 



39 



22 



24 



7 8*23 85-93 130.138 24*38 105-129 94-104 139^77 63-84 39.62 



■Values to panatfhese* are.ttfea (torn tte sequeoee 
tf-Pyridykthyicystcinc. 



data. *>Dettrmin6d &$ ho*o*erf*a plus homoseritu! 



Table vis. Amino edd compositions of tryptfc 
peptides derived from non-reduced CLSMIR 



Araifto aeid 


NT1 


NT2 


NTS 


Asp 


2.1(2) 


3.2(3) 


1.4(1) 


Thr 


0.8(1) 


U(i) 


*0(2) 


Ser 


3-4(4) 


3.6(4) 


1.0(1) 


Glu 


13(0 


4.0(4) 


43(4) 


Pro 




2.9(3) 


1.0(0 


Gly 




1.7(2) 


5.2(5) 


Ab 




1.2(1) 


1.0(1) 


<v> 


1.5(2) 


1.4(2) 


0.7(1) 


Val 




4.1(4) 


2,0(2) 


lie 


1.0(1) 


U(0 


1.6(2) 


Leu 


1.0(1) 


1.2(1) 








14(1) 






1.0(1) 


0.7(1) 


2.4(2) 


His 








Lys 




Z5<3) 


1.0(1) 


Ar« 








Tom 




3) 


25 


Position 


139-154 


41-62 


105-129 



Position of 
disulfide 
_bridge_ 



aVilrcia 



-Cyil49 ^yjBS 



CfreeSH) 



Bthcsca w« UdOen from iho 
i as cysteine, 



AmiDOacid «*¥«iw>M of V8 piocease peptides derived from 
CM -CLSt-lffa. 



AfltoflO 



acid 
Cmc*» 



VI 



V2 



V3 



V4 



V5 



V6 



V7 



V9 



A*p 

Thr 

Ser 

Glu 

Pro 

Gly 

Ala 

Vaf 

Met 

lis 

Leu 

£ 

rfis 
Tip 
Lys 



i.i(i) 

1 10) 



W 

4.9(5) 
1.0(1) 
3.4(4) 
1.2(1) 



3.8(4) 



13(1) - 



t.QCO 1.1(1) - 



30(3) 
1.1(1) 



~ 0-7(1) 1-2(2) - 07(1) - ™ 

1.9(2) 2.0(2) 1.1(1) 3.0(3) 2,9(3) 

2.9(3) 2.5(3) 1.9(2) l.g(2) 

1.5(2) 33(4) Z9(3) 2,1(2) 1.1(1) M(i) 

1.1(1) 2.2(2) 5.0(5) 2,1(2) 22(2) 1.1(1) 

1.9(2) - 2.0(2) 2.6(3) 4.0(4) 1.0(1) 

1.9(2) . 4.6(5) 4.1(4) 53(5} 2,1(2) 

0-9(1) 1.1(0 1.0(1) - 2.1(2) 1.0(1) 

1.8(2) 2.9(3) 2.5(3) 3.0(3) 1.6(2) 

' - - LCK1) 0.9(1) 

1.0(1) 1.0(0 2-1(2) 2.9(3) 1.9(2) 1.0(1) 

1.9(2) 2.0(2) 1.8(2) 

2.7(3) . 

1.8(2) - 



Total 6 
Position 82-87 



■ urn ■ ■ . {jg 

10(0 - - 1.0(0 0.9(0 2.7(3) 

15(2) - 

" " " " 0.7(1) - - 0.7(0 

1,8(2> ' um > 11(0 ^ 1.0(0 1.0(1) 

0-tt» - • - l.K» 0-9(0 1.8(2) 



2.6(3) 
1.9(2) 



ta 



16 



40 



27 



35 



24 



144-15? 1-8 9-19 88-103 104-143 SS81 20.S4 160.18 3 

"Valuta m panyuftoes are taken from toe tequence data. bW^Ao*yt*^ylcysttia*. 



K4 



K2 




K3 



K5/5 



10 20 30 40 SO 60 
Retention Time (rain) 




10 20 30 40 50 



Retention Time (rain) 

2*" Jf ' FlBCtioMI io« » tryptfc digest of peptide K7 in 
sSa wiS ™ LC - ° rt * £olto ™ ° f WO 

ovilo WJtft A Gradient r»f MvtnRiNii. t- n i <v Ti>i 
» - " — *— --- > " v. * rv i * fk. 



so 



40 



30- 



20 1 
o 

10 1 



J- BiocherrL 
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Fig. 3$, Fraciion»<tOT) of a V8 protease digest of PE-CLSl-H 
by reverse- phase HP IX on n column of fiioftne RPC-SC18 
with a gradient of wxtonitrUe in 0.1 % TFA, 



10 20 30 40 SO 60 70 
Retention Time <inin) 





Retention Ti»e (win) 



£?*" 6S./r*«ion*lion of a rryptic digest of peptide M4 In 
Fig. SS by rcver«-phas« HPLC on * column of Biofinc RPO 
SC18 with a gradient of acetonUrik in 0.] % TFA 
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SUMMARY 



The amino acid sequences of the trypsin inhibitors from taro Colocasia esculenta var. esculenta 
and giant swamp taro Cyrtospema chamissonis have been determined and are compared with die 
ff 01 * 111 sequence of the trypsin/chymotrypsin inhibitor from giant taro Alocasia macrorrhiza 
Both inhibitors display polymorphism and there is evidence of two components in the giant 
swamp taro. The positional identity between the proteins is highest at 13-75% for the 
comparison of the giant taro (GT) with the polymorphic forms of the taro (T) inhibitors and 
lowest at 56-58% for the pairs of taro and giant swamp taro (GST) proteins. The comparisons 
show that the inhibitors from T and GT are more related to each other than to GST which 
supports their taxonornic classification into different tribes. Location of the Pi site for the trypsin 
inhibitors of arc-ids is different from that of other Kunitz-type inhibitors and could be at Leu56. 

INTRODUCTION 

Many protease m 

have,been,srudiediOvff* but there has been much less work done on those 

from the Araceae plant family. The latter contains a group of related plants called the edible 
aroids, the most important of which (taro Colocasia and taro Xant kosoma) form the staple diet of 
200 million people living in the tropics (3-5). Taro Xanthosoma sagittifolium is essentially free 
of trypsin and chymotrypsin inhibitors (6), but taro Colocasia esculenta var. esculenta (7,8) and a 
closely related plant, taro Colocasia esculenta var. antiquorum (9-12) contain appreciable 
amounts of trypsin inhibitors. There are three other minor edible aroids of some importance in 

"•To whom all correspondence should be addressed. 
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the Pacific. Of these elephant foot yam (Amorphophallus campanulatus ) has virtually no trypsin 
inhibitor (13), giant swamp taro {Cynosperma chamissonis) contains appreciable amounts and 
giant taro {Alocasia macrorrhiza) large amounts of trypsin inhibitor (8-14). The proteinase 
inhibitor from giant taro inhibits both trypsin and chymotrypsin (7,8,13) and its amino acid 
sequence has been determined (15). The trypsin inhibitors from taro, giant taro and giant swamp 
taro have similar structures. They each consist of two monomers (shown to be identical for giant 
taro (1^ ^ ^0^,. of 

molecular mass ca. 40 kDa from the taro family bind two molecules of trypsin, ie. one per 
monomer (7,15). 

Because the protease inhibitor from giant taro is the only one of the aroid proteins to also 
inhibit chymotrypsin, it is of interest to compare the amino acid sequences of taro and giant 
swamp taro with that of giant taro. Accordingly, in this paper we report the amino acid 
sequences of the trypsin inhibitors from taro (C. esculenta var. esculenta) and giant swamp taro 
and compare them with the known sequence of giant taro (15). 

MATERIALS AND METHODS 
Protein sequencing; Enzyroic and chemical cleavage of the inhibitors and the senararinn «f 



RESULTS 

Amino add sequences: An overlapping sequence of the taro (T) trypsin inhibitor was 
established using tryptic, Glu-C. Asp-N and CNBr cleavages with separation of the fragments by 
HPL -- ^^ence of giant swamp, taro (GST) trypsin inhibitor was determined from tryptic 
chymoiryptic and CNBr cleavages. These sequences together with that of the 
irypsm/chymotrypsin inhibitor from giant taro (GT),preyiously obtained by this laboratory (15) 
ase shown m Fig. I. A limited, number of gaps nave been introduced to maximise positional 
•denmies and similarities, the numbenng,hown is. thatfbr the giant taro (GT) inhibitor. It shouW 
be noted that in our previous pubjicanon (TX.the ^terminal sequences for taro (Colocasia 
esculenta ) and giant swamp taro (Cyrtospemu, chami^ 'nis) were transposed, the correct 
assignment being as in Kg. I. .• • , . 
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Polymorphism: In each case there was evidence of polymorphism. There were only two 
positions in the giant taro inhibitor sequence where polymorphism was detected; a Met or Ala 
occurred in equal abundance at 24, while at position 50 there was Glu or Lys in a ratio of 4:1 in 
the sample subjected to protein sequencing. The successful cloning preferentially isolated the 
Met24 and LysSO forms (A. Matthews et aL, personal communication). In the taro inhibitor there 
were thirteen sites of polymorphism detected in the protein sequencing. These are shown on Fig, 
1 by having in line (Ta) the complete sequence with one of the possible amino acid residues at 
each site of polymorphism, while in line (Tb) only the alternate residues at those sites are 
indicated (eg. He or Val at position 4), the rest of the sequence being as in (Ta). The 
polymorphisms shown occurred in approximately equal proportion, however it is not possible to 
say from the protein sequencing of the various isolated fractions which* combination of amino 
acids occur in any one molecule of inhibitor. The previously reported (7) isoelectric focussing 
profile of the taro inhibitor sample showed five discreet bands plus a diffuse area. It could be 
argued that the sequencing results are consistent with the observed micro heterogeneity. Given 
that seven of the substitutions involve charge differences, (five involving a change between an 
uncharged and a negative charge, one between an uncharged and a positive charge and one 
between a negative and a positive charge) and that the migrating species would be homo- and 
heterodimers (6) there are unlikely to be many different monomer species and there would be 
internal compensation of the charge differences. 

Since the completion of the protein sequencing we have become aware of some work on the 
cloning of taro (Cobcosia esculenta Schott) storage proteins (16). Comparison of the amino acid 
sequence of the protein and that deduced from the two clones (named 5.1 and 10.1) for the 25 
kDa protein clearly indicates that the supposed storage protein is in fact the taro trypsin inhibitor* 
In Table 2 are listed the sites of polymorphism observed in both the protein and cDNA 
sequencing of the taro trypsin inhibitor. For completeness Table 2 includes those positions 
where, while there is no change in the amino acid, there is change in the third base of the codon 
for the amino acid. Also noted are those positions where there is difference between the protein 
and nucleotide derived sequences although no difference was observed within the two sets of 
data. Attention is drawn to those positions in the taro sequences where polymorphism has been 
observed also in our protein sequencing of the giant swamp taro (GST) inhibitor and to where the 
protein sequencing indicates some conservation of an amino acid at a position in the chain. 
Positions where gaps have been introduced in either the protein or cDNA sequences in order to 
facilitate the alignment of the sequences are listed. The numbering used in Table 2 is as in Fig. 1, 
ie. that for the giant taro (GT) inhibitor. In the alignment of GT, T and GST sequences in Fig. 1 
the taro sequence has an additional residue between positions 40-50; this is numbered 46A in the 
protein and it can be noted that a gap is introduced in the nucleotide sequence, while there is a 
compensating insertion in that sequence at 48A to maximise the alignment In the region of 
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positions 128-130 the two clones are different and 10.1 has six extra bases (two amino acids). 
This occurs where, in the alignment of the inhibitors in Fig. 1, gaps have been introduced and 
may correspond to the junction of exons. As noted above, the assignment of amino acids from 
the protein sequencing to lines Ta or Tb was purely arbitrary. Within a clone there is indication 
of a particular sequence of amino acids, however only two clones have been sequenced whereas 
at least five protein species have been observed. Therefore until further clones, which relate to 
variations of the protein sequence, have been studied the assignment of amino acids must remain 
arbitrary. A reallocation of amino acids to lines Ta and Tb on the basis of the limited number of 
clones would not be consistent. Some differences between the two sets of data could be 
expected* due to possible differences in the variety of taio used in each case. 

For the giant swamp taro (GST) the meaning of lines (GSTa and b) is somewhat different from 
lines (Ta and b) of taro* The isoelectric focussing profile of the GST inhibitor had shown a small 
number of bands within a broad diffuse area and during the sequencing it became apparent that 
the sample, as isolated from a trypsin affinity column, consisted of a major and a minor 
component which could be partially separated by reverse phase HPLC From the heights of the 
peaks of the trial HPLC, the two components occurred in a ratio of ca. 4: 1 and they both gave the 
same Af-terminal sequence to twenty residues. The mixture was subjected to enzymic digestion 
and die resultant peptides separated by HPLC and sequenced. The data were interpreted being 
mindful that there were two components with polymorphic variation expected. Within line 
(GSTa) is shown the sequence of a tryptic peptide which occurred as a major peak and resulted 
from hydrolysis after Lys45 and Arg69. In line (GSTb) are those residues which differed in that 
region. The alternate sequence came from a smaller peak (ca, one quarter the height of the major 
peak) which eluted off the Cia column at a slightly lower %B (15). Consistent with normal 
observations trypsin did not cleave the Arg-Pro bond at 54-55 and it appeared that there was 
preferential cleavage by the endoprotease after an Arg-Arg sequence in this peptide, which 
extended that form to position 70. It is feasible that the sequence from 46 to 69 in line (GSTa) 
was derived from the major component and that in line (GSTb). residues 46-70 were derived 
from the minor component. While work is in progress to test this interpretation, it is clear that in 
that region there are two substantially different sequences: the differences occur as a set and do 
not mean that there is multiple polymorphic variation in this region. Hence while there is a major 
difference between the two forms of inhibitor as isolated from the giant swamp taro and this is 
shown in the region 46-70 of lines (GSTa and b), the other sites of difference could be in either 
component and it is noted that some occur at positions where polymorphism has been observed in 
taro. 

Positional identities: In Fig. 1 the underlining in line (GSTa) indicates those residues which are 
common to all forms of the inhibitors from the three aroid species. There are 80 such residues, 
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TABLE L Postitional Identities and Similarities in Trypsin Inhibitors I 

I 

w 

V 

For each pair of inhibitors die Percent of Postitional Identities are listed, also tabulated are the Quality • 
and Percent Similarity between pairs. The data were derived by application of the GCG program "Gap 1 * 
which uses the algorithm of Needtteman and Wunsch (17). The inhibitors are as in Fig. 1: GTa, giant • 
taro; GTb, giant taro polymorphic variant; Ta, taro; Tb, tare variant; GSTa, giant swamp taro; GSTb * 
giant swamp taro variant (see text). * J 



Quality* (top) and Percent Similarity (bottom) 



inhibitor 


GTa 


GTb 


Ta 


Tb 


GSTa 


GSTb 


GTa 


Quality 
% Similarity 


273.3 
98.9 


214.5 
84.1 


208.8 
83.0 


182.2 
75.8 


178.5 
75.8 


1 Identit 


98.9 




213.3 
83.5 


207.1 
82.4 


180.5 
74.7 


176.8 
74.7 


1 Ta 

© 

-»■* 

w 


75.3 


75.3 




261.1 
96.2 


164.0 
71.8 


164.0 
72.4 


p 

t Tb 


73.1 


72.5 


92.4 




156.5 
70.2 


155.7 
70.7 


& GSTa 


58.2 


58.2 


56.4 


52.5 




248.1 
92.4 


GSTb 


56.0 


56.0 


55.2 


52.5 


86.9 





♦Quality = ComparativeValue (CmpVal)^ x Totals + CrapValAB x Total ab + CmpVal A c x Total A c 
GapWe^h^^^^ 



which is 43.5% over a total of 184. The percent positional identity, quality and percent similarity 
for each pair of inhibitors are shown in Table 1. The pairwise comparisons were performed with 
the Wisconsin GCG program 'Gap' which uses the algorithm of Needleman and Wunsch (17) and 
gives values for the quality and percent similarity using a symbol comparison table, with matches 
equal to 1 J and mismatches based on the evolutionary distance between the amino acids (18). 
The values in the line or row for say Ta refer to those for the sequence shown as line (Ta) in Fig. 
1, while those in Tb are derived from line (Tb). Because of the polymorphism and not knowing 
which of the alternate amino acids actually occurs in a particular molecule of inhibitor, the values 
in each case should be viewed as giving a range for the species. There is 92.4% positional 
identity between the sequences shown in lines (Ta and b) and when GTa is compared with each 
of them there is a range of from 73.1-75.3%. The positional identity between the sequences 
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TABLE 2. Polymorphisms in Taro Trypsin Inhibitor 

The amino acids which occur at each site of polymorphism in the taro trypsin inhibitor are shown in 
columns Ta and Tb, which are from the protein sequences found in Fig. 1, and 5.1 and 10.1, which 
refer to the clones isolated by ffirai et al. (16). their nucleotide sequence being translated to amino 
acids. The numbering is as in Fig. 1, Le. for the giant taro (GT) sequence. 



Tb 

5.5 
i.8 

1.7 

1.0 
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Position Ta Tb S.l 10.1 Notes Position Ta Tb 5.1 10.1 Notes 



4 



10 



16 



23 



25 



27 



34 
39 



44 



46 



~46T 



47 



48 



48A 



49 
IT 



63 



64 



72 
75 
80 
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82 



83 



86 
91 
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127 



N 



D 



A 



N 



E 
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128B 
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Jul 



a,d 
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149 



151 
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H 



K 



R 
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R 



K 



N 



M 



K 



a,d 



a,h 



a,d 



a,d 



a,d 



^hangein third base of codon; b, no difference in nucleotide sequence; c, site of polymorphism in 
G^rt^l'to G^l^C^ermSus. hi A in GT and GST; i, S in GT and GST; j, M in GT and 
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shown in lines (GSTa and b) for the giant swamp taro inhibitors is lower at 86.9% and the 
comparisons with GTa gives a range of 56.0-58.2%. There is clearly more positional identity 
between GT and T than GT and GST, The least similar are T and GST, with an overall range of 
only 52.5-56*4% positional identity. The data for quality and percent similarity show the same 
trend with the best values for GT compared with T having an overall range of 207.1-214.5 for 
quality and 82.4-84.1% similarity. The least similar are T to GST, the overall range being 155.7- 
164.0 for quality and 70.2-72.4% similarity. The inhibitors from T and GT appear to be more 
related to one another than to GST, which supports the taxonomic classification of T and GT in 
the tribe Colocasiodeae and GST in the Lasioideae (19). 

DISCUSSION 



We know from our previous studies (7) that the proteins which have been sequenced are 
trypsin inhibitors and naturally a major point of interest is to determine which residues are 
specifically involved in the inhibition, that is to define the Pi site (l).^Pstatediin ? |te 
imwluctie^tl^^ 
diffwnc^n^ 

inhi&tp^^ 
55 *^*on^ 
othe^Kunitz^e*mhiM^ 
whether,^ 

However for the aroid sequence such alignment has Glu61 in the equivalent position to the Pi 
Arg63 residue of the soybean trypsin inhibitor (20). A glutamic acid residue is an unlikely site 
for trypsin or chyraotrypsin inhibition and we noted that the aroid Pi loop region is different from 
that of other Kunitz-type inhibitors. Following from a report of genetic engineering which 
selected a greatly improved neutrophil elastase inhibitor (21) we noted a striking similarity to 

their Sennen£e nf Tte-A1s»-Pfw*-Pfi*>_Pn% ti^ ;~ •u* * i__ . 

■ "~ ^ -~ m ~~ " »"^*v *»*v *o wv x- 1 mw; mc specuic innioition or 

neutrophil elastase. Residues 56-60 of the giant taro inhibitor are Leu-Ala-Phe-Phe-Pro and it 

was argued that with a Uu as the Pi site the inhibitor might bind both trypsin and chymotrypsin 

and that this be the inhibitor site. The current sequencing shows that this region is conserved in 

all three aroid inhibitors and hence throws doubt on the choice of Leu56-Ala57 as the scissile 

bond. If Leu56 were the Pi residue in the giant taro trypsin/chymotrypsin inhibitor, we need to 

postulate that there are sequence driven structural changes in the remaining loop region, which 

prevent the taro and giant swamp taro from having the broader specificity. Future work will 

concentrate on a chemical approach (20) to defining the Pi sites in the amid inhibitors. 
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Abstract — Proteinase inhibitors from cabbage foliage {Brassica oleracea) had trypsin and chymotrypsin inhibitory 
activity that was relatively stable over a broad range of temperatures (0-100°) and pH values (4.5-7.5). The six 
proteinase inhibitors that were purified by affinity chromatography had that ranged from 9000 to 25000, and 
isoelectric points that ranged from 4.5 to 5.0. Separation of these affinity-purified proteins by reverse phase HPLC 
resulted in 14 unique protein species with trypsin and chymotrypsin inhibitory activity. Based on similarities in the 
amino acid content, the HPLC-purified inhibitors were arranged into four groups. 



INTRODUCTION 

Serine proteinase inhibitor proteins are widely distribu- 
ted in the plant kingdom [1], and inhibit the activity of 
enzymes (e.g. trypsin and chymotrypsin) that are com- 
monly responsible for digestion in animals and micro- 
organisms. In addition, these inhibitors have no apparent 
regulatory function in the plant. Thus, they are generally 
thought to contribute to the defence of plants against 
pathogens and herbivores [2, 3]. The most thoroughly 
examined plant serine proteinase inhibitors are found in 
species of Leguminosae, Gramineae and Solanaceae [1]. 
These inhibitors have been classified into at least seven 
non-homologous families based on their amino acid 
sequences [2, 4]. Proteinase inhibitory activity is present 
in the seeds, foliage and roots of cruciferous plants [5-7]. 
However, only a few seed proteins have been character- 
ized [8-10]. Although the proteinase inhibitors in the 
foliage of crucifers have not been characterized, trypsin 
inhibitory activity in cabbage foiiage has been shown to 
be developmentally regulated, appearing in high concen- 
tration in the young foiiage on mature plants [11]. The 
present study describes the purification and partial 
characterization of the trypsin/chymotrypsin inhibitors 
in the young foliage of mature cabbage plants {Brassica 
oleracea L. cv Superpack). 

RESULTS 

Purification of cabbage proteinase inhibitors 

After the initial extraction of proteins from young 
foliage on mature plants, affinity chromatography was 
used to separate the trypsin inhibitors from other con- 
taminating proteins (Fig. 1), and HPLC was used to 
separate the trypsin inhibitors into at least 10 distinct 



peaks of protein that inhibited both trypsin and chymo- 
trypsin (Fig. 2). There was a linear relationship between 
the in vitro concentration of inhibitor and the level of 
trypsin inhibited (y = 0.003 + 0.07x, r 2 =0.982). 

Basing purification on increases in specific activity, a 
67-fold purification of the proteins responsible for trypsin 
inhibitory activity was achieved following affinity chro- 
matography (Table 1). With each purification step 
through affinity chromatography, the ratio of trypsin to 
chymotrypsin inhibitory activity remained the same (i.e. 
ratios were 1.7:1 for crude leaf juice; 1.9:1 for semi- 
purified PI; 1.8: 1 for affinity-purified PI). This suggests 
that the chymotrypsin inhibitory activity originated from 
the same proteins that had affinity for trypsin, and 
therefore were "double-headed 1 inhibitors, as has been 
reported for other plant proteinase inhibitors [1], The 
dual activities were also noted for the inhibitor species 
separated by HPLC (Table 2), although the ratios of 
trypsin to chymotrypsin inhibitory activities differed am~ 
ong protein species. This difference in the ratios of trypsin 
to chymotrypsin inhibitory activities suggests there is a 
difference among the proteins in their binding capacity 
(i.e. number of binding sites, strength of binding) for 
trypsin and chymotrypsin. 

Characterization of cabbage proteinase inhibitors 

Stability of inhibitory activity. Trypsin and chymotryp- 
sin inhibitory activities in the semi-purified proteinase 
inhibitors were stable (100-71 % activity) at high temper- 
atures (100°) in a mildly acidic environment (1 mM HC1) 
for 10-120 min, and were significantly reduced (9-10% 
activity) only when autoclaved. However, the ther- 
mal stability of the trypsin inhibitory activity was signific- 
antly reduced when the inhibitor was incubated at 100° 
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JTab^ Punfication of trypsin inh ibuorsfrom 500 g of cabbage fo.iage 



Purification 
step 

Leaf juice 

Semi-purified 

Affinity-purified 



Total 
activity 
(nig trypsin 
inhibited) 

934 
748 
133 



Total 

protein 

(mg) 

5130 
559 
II 



Specific 
activity 

0.18 
1.34 
111 



Recovery Purification 
(%) (fold) 



100 
80 
14 



I 

7 
67 
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Tabic 2. Inhibitory activity and M r or H PLC- purified cabbage trypsin/ 
chymotrypsin inhibitors 



HPLC 


mg try p. inhib. 


mg chyrno. inhib. 






fraction no. 


(mg protein) 


(mg protein) 


Protein 


M r xl0" 3 


6 


11 


4 


la 


12 








lb 


9 


14 


3 


0.5 


II 


10 


18 


5 


4 


III 


10 


31 


0.2 


0.5 


IV 


11 


39 


6 


6 


Va 


12 








Vb 


9 


47 


0 


1 


VI 


10 


74 


5 


0 


VII 


25 


77 


3 


1 


Villa 


24 








VHIb 


11 


83 


1 


0.4 


IXa 


24 








IXb 


11 


86 


4 


2 


X 


22 




PH 

Fig. 3. Thermal and pH stability of cabbage proteinase in- 
hibitors. A sample of semi -purified cabbage proteinase inhibitor 
(2 nig ml' 1 1 mMHCl) was incubated in I ml of citrate phos- 
phate (triangles), sodium phosphate (circles), or glycine- 
NaOH (squares) buffer, at 0° (dashed line) or 100° (solid line), for 
10 min, then tested for trypsin inhibitory activity. 

protein bands during gel electrophoresis of affinity-puri- 
fied trypsin inhibitors, and/or biochemical modification 
of the proteins (e.g. glycosylation, de-amidation) that 

TV»cn1t*»H in cpnorotP *>liifirtn frr\m tK*» MPT (~* rnlumn KA c 

r — r* 

of the HPLC-pu rifled protein species are indicated in 
Table 2. 

Amino acid analysis. To ensure that the amino acid 
analyses for the immobilon-transferred proteins were as 
accurate as the proteins that were analysed directly, two 
protein species (fractions 18 and 77) were analysed by 
both methods. The results (data not shown) indicated that 
immobilon-transfer of protein did not introduce signific- 
ant errors in the amino acid analysis. The amino acid 
analyses (Table 3) were organized into four groups based 
on similarities in amino acid content. 

DISCUSSION 

Standard procedures (i.e. ammonium sulphate pre- 
cipitation of aqueous extract, followed by affinity chro- 



matography) were used to extract and purify six trypsin 
inhibitors from cabbage foliage. These inhibitors were 
relatively small (M, 9000-25000), acidic (pi 4.5-5.0) 
proteins that were stable over a relatively broad range of 
pH values (4.5-7.5) and temperatures (0-100°). In general, 
these inhibitors had at least two active inhibitory sites: 
one specific for trypsin, the other specific for chymotryp- 
sin. There was a difference between the number of trypsin 
inhibitors from non-chlorophyll-containing foliage (i.e. 
inner cabbage head) compared with chlorophyll-contain- 
ing foliage. The inhibitors from the chlorophyll-contain- 
ing foliage included two additional small, acidic proteins 
that were not present in the non-chlorophyll-containing 
foliage. At this point it is unclear whether the two 
additional proteins are found in green foliage, or are an 
experimental artifact resulting from proteolysis, or some 
other post-extracttonal biochemical modification. How- 
ever, these proteins were consistently detected in each 
affinity-purified extract of young green foliage on mature 
plants, while they were absent from all extracts of white 
foliage. 

The trypsin/chymotrypsin inhibitors from cabbage 
foliage are distinct from the serine proteinase inhibitors 
isolated from seeds of other crucifers, which are trypsin 
specific, basic proteins with unique amino acid contents 
[9, 12, 13]. 

The cabbage trypsin inhibitors were divided into four 
groups based on amino acid analyses (Table 3). The 
protein species in Group A were clustered, based on their 
large number of alanine and valine residues per mol of 
inhibitor, while the protein species in Group D were 
linked, based on the large number of arginine residues per 
mol of inhibitor. The four groups of inhibitors were also 
distinguished by the average per cent of hydrophobic 
amino acid residues: (A) 36± 1.8%, (B) 31 ± 1.1%, (C) 14 
±3.0% and (D) 1.6 ±0.1%. However, amino acid se- 
quences of these protein species are needed to determine 
the genetic relationship between these inhibitors. 



24 



R- M. Broadway 




strips (0.88 M Alanine. 0.25 M Tris, pH 8 « Lane a Lm^ 8 8 ' ^ W " h "o^^ng buffer 

P-naseinh^ 

inhibi.ory activity) for bovine trypsin and chymotrypsin 
! 1 2) ' Perhaps this variability reflects a potential for 
inhibition of a number of different serine proteases), from 
different species of herbivorous insects and/or plant 
pathogens. 



EXPERIMENTAL 

Plants Seeds for the cabbage cultivar Superpack 
(Johnny s Selected Seeds, Albion, ME) were germinated 
tn Cornell Mix [14] in 7.6-1 plastic pots. The seedlings 
were thinned to one plant per pot, and maintained in a 
greenhouse under 1 kW metal halide lamps (12 hr light- 
12 hr dark) at 28°. The plants were watered three timesV 
week, and fertilized once a week with a water-soluble 
nutrient mix (I6N-32P-16K). 

fny™ assays. A standard spectrophotometry assay 
[15] was used to determine the presence of trypsin 
inhibitory activity in the cabbage extracts and purified 
iracuons. bovine trypsin (type HI, Sigma) was 60% active 
as determined with the active site titrant, p-nitrophenyl- 
P-guan.dmobenzoate [16]. Bovine trypsin (0.1 mgml- 
1 mM HCI) was mixed (1:1, v/v ) with the plant proteinase 
.nh.b.toi : (2 mgml-'), and incubated at room temp, for 
10 mm. Then 100 jd of the mixt. was added to 2.9 ml of 
buffer (0.05 M Tris, pH 8.0) containing 1.04 M p- 
toluene-sulfonyl-L-arginine Me ester. Trypsin activity 
was monitored at 247 nm for 3 min, and compared with 
the activity of 50 /d of uninhibited trypsin. A micro- 
technique was used to measure trypsin inhibitory activity 
m affinity- and HPLC-purified proteins. Fractions were 
equilibrated based on protein content, then a 24-hI 
aliquot of the sample was mixed (1 : 1, v/v) with trypsin, as 
described above. The mixt was added to 500 /il of 
substrate, and analysed for trypsin activity as described 




Fig. 5. Polyacrylam.de gel electrophoresis of affinity-purified 
cabbage proleinase inhibitors. The gel consisted of 10% poly- 
acrylamide. pH 8.8, with a 4% polyacrylamide stack, pH I 6.8 
1.5 mm thick The left portion of the gel was stained with 
Coomassie, while (he right portion of the gel was incubated in a 
trypsin solution, then stained with acetyl-phenylalanine-*- 
napnthyl-ester, a trypsin-specific chromophoric substrate. 

As in other plant families, such as Solanaceae and 
L*gum,nosae. the Cruciferae (e.g. cabbage) contained a 
large number of inhibitor species. Based on the fact that 
these inhibitors have differential binding capacity (i e 
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Fig, 6. SDS-PAGE of affinity-purified cabbage proteinase inhibitors. A discontinuous polyacrylamide gel (7.5% 
stacking gel. 20% separating gel) was used with buffer strips containing 0.2 M Tricine, 0.2 M Tris, 0,55% SDS, pH 
8.1. Lane a contained white proteinase inhibitors, lane b contained M r markers, and lanes c-e contained green 
proteinase inhibitor, at concentrations of40, 20 and 10 /ig per lane, respectively. The M r markers were 66, 45, 36, 29, 
24, 20.1 and 14.2 x 10 3 , while the proteinase inhibitors were determined to be 22-24, 17, 15, 12.5, 1 1 and 10.5 x 10 3 
for the green proteinase inhibitors, and 21-24, 17, 15 and llx 10 3 for the white proteinase inhibitors. 



Table 3. Amino acid analyses of HPLC-purified cabbage trypsin inhibitors 



Group A Group B Group C Group D 





A' 




A" 




B' 




B" 




C 


C" 




C" 






Protein 


IXa 


VII 


Villa 


X 


II 


III 


Va 


la 


Vb 


VHIb 


IXb 


lb 


IV 


VI 


M r xlO" 3 


24 


25 


24 


22 


10 


10 


12 


12 


9 


11 


11 


9 


11 


10 


AA* 






























Asx 


17.5 


18.7 


18.6 


18.0 


9.8 


9.6 


12.9 


10.8 


6.7 


6.1 


7.5 


4.7 


5.4 


5.4 


Glx 


2Z6 


21.4 


21.3 


20.0 


8.0 


8.2 


9.5 


10.7 


9.2 


10.6 


11.8 


6.9 


123 


11.6 


Serf 


22.0 


16.9 


16.1 


14.3 


2.9 


2.9 


5.0 


9.8 


5.2 


3.7 


2.9 


3.1 


0.2 


0.1 


Gly 


44.8 


43.5 


27.8 


26.1 


11.5 


12.1 


15.2 


16.1 


15.8 


19.0 


18.9 


51.5 


19.8 


17.6 


His 


1.6 


0.7 


0.3 


0.1 


1.0 


0.8 


1.5 


0.7 


0.9 


0.7 


0.5 


0.5 


0.1 


0.1 


Lys 


12.2 


12.5 


12.7 


129 


6.5 


7.8 


7.9 


7.6 


5.0 


5.5 


6.2 


4.2 


6.0 


5.4 


Arg 


14.5 


13.7 


14.1 


13.6 


6.1 


7.6 


6.7 


9.3 


13.3 


21.7 


24.3 


127 


31.7 


28.4 


Thrt 


11.3 


1 1.4 


12.1 


11.1 


3.6 


4.2 


5.2 


3.0 


5.6 


8.7 


9.6 


4.8 


12.8 


11.5 


Cyst 


1.0 


0.0 


1.2 


3.0 


8.5 


7.4 


4.1 


4.2 


0.0 


0.0 


0.9 


0.9 


5.0 


1.2 


A!a 


13.! 


!Z! 


1 2.2 


in 5 


2.7 


2.9 


A A 


Aft 


2.5 


1.8 


1 Q 


i 1 


0 2 


A f 

v.* 


Pro 


12.7 


15.2 


16.5 


17.1 


7.0 


6.1 


7.1 


4.2 


1.6 


2.0 


0.7 


1.0 


0.1 


0,1 


Tyr 


8.4 


12.5 


9.9 


5.2 


3.8 


4.1 


6.1 


8.5 


5.1 


3.9 


2.8 


4.2 


0.0 


0.0 


Val 


13.7 


17.0 


17.5 


16.8 


5,4 


5.5 


5.1 


3.5 


2.0 


2.3 


1.1 


1.0 


0.2 


0.2 


Metf 


0.3 


0.0 


0.1 


0.9 


0.2 


0.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.6 


0.5 


lie 


9.6 


11.3 


11.3 


9.9 


2.7 


2.5 


25 


3.5 


1.8 


1.7 


1.2 


1.1 


0.2 


0.1 


Leu 


14.5 


14.5 


14.3 


13.0 


4.1 


4.1 


6.2 


4.8 


3.7 


4.1 


2.2 


1.7 


0.2 


0.1 


Phe 


11.5 


14.8 


15.2 


14.1 


5.3 


5.9 


6.8 


4.8 


2.1 


1.0 


0.0 


0.6 


0.2 


0.2 



•Amino acid content expressed as residues mol" 1 of inhibitor. 
fAcid hydrolysis is destructive to these amino acids. 



above. A 12-^1 aliquot of trypsin was used to measure 
uninhibited trypsin activity. 

Chymotrypsin inhibitory activity was determined by 
mixing the plant extract or purified sample (at the same 
concentrations indicated above) 1 : 1 (v/v), with TLCK- 



treated bovine chymotrypsin (0.1 mgml" 1 1 mMHCl) 
for 10 min at room temp. Then, 80 /*1 of the mixt was 
added to 2.9 ml of substrate [1 mM benzoyl-L-tyrosine Et 
ester in 5% MeOH, mixed 1:1 (v/v) with 0.05 M Tris, 
pH 8], and monitored at 256 nm for 3 min [17], The results 
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£L\ h . " °l M * ° f Cabba « e P ro,einase int »™°* by 
l.qu.d chromatography on Sephadcx G-75. The open square 

.nd,ca, e pro,e,n elution; ,he closed circles indicate tryr*in inhib- 
itory activity. The M, markers eluted as followTcarbonfc 

(20.1x10) in fracdon 12 and cytochrome c (12.4x10') in 
fraction 16. 



were compared with the activity of 40 pi of uninhibited 
chymotrypsm. The chymotrypsin inhibitory activity in 
affin.ty- and HPLC-purified proteins was measured by 
the micro-technique described for measuring trypsin in- 
hibitory activity. 

Protein determination. Protein concn was estimated 
using bicinchoninic acid reagent [18]. Purified protein 
irom cabbage foliage was used as the standard. 

Purification of proteinase inhibitors 

fraction. Proteinase inhibitors were extracted from 
mature cabbage plants, using either fresh, young foliage 
mcludmg the cabbage head and surrounding green 
leaves) or only the white foliage of the cabbage head A 



single extraction consisted of homogenization of 500 e of 
fo .age m 500 ml of ice-cold, 0.1 M Tris-HC. buffer^ 7 
01 /« ascorb IC acid (w/v). The homogenate was sauL^w 
through a double layer of cheesecloth 8 and ,L i^ 

at 4^f„r , h SUp ? rna,ar " at 80% «tn. and incubated 
at 4 for 1 hr w,th st.rr.ng. After incubation, the mixt 
was centnfuged a. 6000 4°, for 20 min, then he X 

r 2 a ^,s ded in H ^ and diaiysed 

12000-14000) against HjO at 4". The dialysate was 
centnfuged, and the supernatant was lyophilLd. This 
preparation was called semi-purified proteinase inh £ 
tors. The matenal that originated from total young 

ouriL 0 " ma,Ure P, u mS W8S deS, ' 8nated '^een semi 8 
punfied protemase inh.bitors', and the material from the 
white fol age ,n the cabbage head was designated 'white 
semi-punfied proteinase inhibitors' 

Affi ni y< hroma t°<>™Phy- Semi-purified proteinase in- 
lubitor (100 mg 50 ml " ' 0.01 M Tris, P H 8 1, 0. 1 M KC3) 
was apphed to an affinity column (bovine t^psin bound 
to cyanogen bromide-activated Sepharose 4B 15 
w r o L 8 ! 4 ' 1116 C0lumn was washed w «h 0.01 MTris 

So ii ,? Kai until ,he A * 280 nm 

ml 2? tk P /° te,naSe inhibit0r was eluted wit " 8 M 
r,^n P \ CUOnS fr ° m the entire P rotei « Peak (A 

ryophiS. Were ^ dia ' ySed a8ainS, H *° at ^ 

Characterization of cabbage proteinase inhibitors 

Polyacrylamide gel electrophoresis. The PhastSystem 
etorophores,* unit (Pharmacia) was used to deteLn" 

he number of protein bands, M, and isoe.ectric point of 
the affin.ty-pur.fied trypsin inhibitors. The PhasSystem 
was also used to determine the M r (20% disconSou" 
polyacrylamide PhastGel, with Tris/SDS buffer stripTof 
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4.55 
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HPLC-purificd trypsin inhibitors (a 5-/*g aliquot of each 
protein). Each test sample was dried in a Speed-Vac, 
resuspended in 3 fi\ deionized, distilled H 2 0+1 ^1 4x 
sample buffer [40 mM Tris, 4 mM EDTA, pH 8.0, 20% 2- 
mercaptoethanol, 0.04% bromophenyl blue (w/v)], then 
placed in boiling water for 5 min prior to SDS-PAGE 
analysis. 

To determine the purity of the affinity-purified trypsin 
inhibitor, a 75-/il aliquot of a 4 mg ml" 1 soln of affinity 
purified trypsin inhibitor was applied to a vertical, dis- 
continuous, non-denaturing, 1.5 mm, 10% polyacryl- 
amide gel, pH 8.8, with a 4% polyacrylamide stacking gel, 
pH 6.8. The reservoir buffer was 0.025 M Tris, pH 8.3, 
0.192 M glycine. Following electrophoresis, the gel was 
stained with Coomassie Brilliant Blue R (20% MeOH, 
30% HOAc, 0.1% Coomassie), and destained with 30% 
MeOH, 10% HOAc to detect all protein bands. To detect 
protein bands with trypsin inhibitory activity, a modifica- 
tion of the method from ref. [19] was used. The poly- 
acrylamide gel was washed four times with 30% MeOH, 
20% HOAc to fix the protein, then rinsed with H 2 0 
twice. The gel was equilibrated overnight in 0.1 M NaPi 
buffer, pH 8, then incubated in a trypsin soln (0.1 mg 
trypsin ml" 1 0.1 M NaPi buffer, pH 8) for 30 min at 37°. 
The gel was rinsed twice with H 2 0, then covered with a 
freshly prepared soln of 2.5 mg acetyl-phenylalanine-j?- 
naphthyl-ester (APNE) in 1ml dimethylformamide 
+ 9 ml 0.55 mgml 1 tetrazotized o-dianisidine, in 0.1 M 
NaPi, pH 8. The gel was incubated in the APNE soln for 
30 min at 37°, then rinsed with H 2 0. Digestion of APNE 
by trypsin resulted in the release of a red chromophore. 
Clear bands indicated the inhibition of trypsin. A lane 
containing BSA (no trypsin inhibitory activity) was run as 
a control. 

Amino acid analysis. The amino acid analysis of each 
HPLC-purified protein was performed at the Cornell 
Amino Acid Facility on the Waters Pico-Tag HPLC 
System [20]. HPLC fractions containing a single protein 
(as determined by SDS-PAGE) were analysed for amino 
acid composition directly as follows. A 10-//g aliquot of 
each fraction was dried in a Speed-Vac, acid hydrolysed 
(6 MHCI, 95 min, 150°, under partial vacuum and N 2 ), 
then analysed for amino acid composition. HPLC frac- 
tions containing more than one protein band were elec- 
u ophoresed, to separate the proteins prior to amino acid 
analysis. A 10-/*g aliquot of each fraction was dried in the 
Speed-Vac, electrophoresed on SDS-PAGE, as described 
above (PhastSystem, Pharmacia), then the proteins were 
electrophoretically transferred to PVDF transfer mem- 
brane (Immobilon-P, 0.45 urn, Millipore) using the Phast- 
System Transfer Unit and Tris-glycine transfer buffer 
(25 mM Tris, 192 mM glycine, pH 8.5, 20% MeOH). The 
Immobilon was stained with Coomassie (0.1% Coomas- 
sie, 60% MeOH, 7.5% HOAc), destained with 75% 
MeOH, 7.5% HOAc, then exhaustively rinsed with de- 
ionized, distilled H 2 0, to remove excess glycine origin- 
ating from the transfer buffer. Each protein band was acid 



hydrolysed (6 M HCI, 115 min, 150°, under partial 
vacuum and N 2 ), the amino acids were extracted from the 
Immobilon, then analysed for amino acid composition 
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